一本色道综合久久欧美日韩精品_国产一区二区黑人_91人妻精品丰满熟妇区九色_久久99久久99精品免费看动漫_最新激情中文字幕第一页_久久无码专区外国精品_久久国产精品免费一区下载

Evan L. Ray,Logan C. Brooks,Jacob Bien,Matthew Biggerstaff,Nikos I. Bosse,Johannes Bracher,Estee Y. Cramer,Sebastian Funk,Aaron Gerding,Michael A. Johansson,Aaron Rumack,Yijin Wang,Martha Zorn,Ryan J. Tibshirani,Nicholas G. Reich

The U.S. COVID-19 Forecast Hub aggregates forecasts of the short-term burden of COVID-19 in the United States from many contributing teams. We study methods for building an ensemble that combines forecasts from these teams. These experiments have informed the ensemble methods used by the Hub. To be most useful to policy makers, ensemble forecasts must have stable performance in the presence of two key characteristics of the component forecasts: (1) occasional misalignment with the reported data, and (2) instability in the relative performance of component forecasters over time. Our results indicate that in the presence of these challenges, an untrained and robust approach to ensembling using an equally weighted median of all component forecasts is a good choice to support public health decision makers. In settings where some contributing forecasters have a stable record of good performance, trained ensembles that give those forecasters higher weight can also be helpful.

相關內容

COVID-19

關注 0

基 · 優化器 · Performer · 無偏 · 相關系數 ·

2022 年 4 月 20 日

Optimal reconciliation with immutable forecasts

Bohan Zhang,Yanfei Kang,Anastasios Panagiotelis,Feng Li

The practical importance of coherent forecasts in hierarchical forecasting has inspired many studies on forecast reconciliation. Under this approach, so-called base forecasts are produced for every series in the hierarchy and are subsequently adjusted to be coherent in a second reconciliation step. Reconciliation methods have been shown to improve forecast accuracy, but will, in general, adjust the base forecast of every series. However, in an operational context, it is sometimes necessary or beneficial to keep forecasts of some variables unchanged after forecast reconciliation. In this paper, we formulate reconciliation methodology that keeps forecasts of a pre-specified subset of variables unchanged or "immutable". In contrast to existing approaches, these immutable forecasts need not all come from the same level of a hierarchy, and our method can also be applied to grouped hierarchies. We prove that our approach preserves unbiasedness in base forecasts. Our method can also account for correlations between base forecasting errors and ensure non-negativity of forecasts. We also perform empirical experiments, including an application to sales of a large scale online retailer, to assess the impacts of our proposed methodology.

MoDELS · 表示 · 語言模型化 · 子空間 · CASE ·

2022 年 4 月 20 日

Analyzing Gender Representation in Multilingual Models

Hila Gonen,Shauli Ravfogel,Yoav Goldberg

from arxiv, Published at RepL4NLP 2022

Multilingual language models were shown to allow for nontrivial transfer across scripts and languages. In this work, we study the structure of the internal representations that enable this transfer. We focus on the representation of gender distinctions as a practical case study, and examine the extent to which the gender concept is encoded in shared subspaces across different languages. Our analysis shows that gender representations consist of several prominent components that are shared across languages, alongside language-specific components. The existence of language-independent and language-specific components provides an explanation for an intriguing empirical observation we make: while gender classification transfers well across languages, interventions for gender removal, trained on a single language, do not transfer easily to others.

entity · 數據集 · 命名實體識別 · 類別 · 數據獲取 ·

2022 年 4 月 19 日

Named Entity Recognition for Partially Annotated Datasets

Michael Strobl,Amine Trabelsi,Osmar Zaiane

from arxiv, Long version of our short paper accepted at NLDB 2022

The most common Named Entity Recognizers are usually sequence taggers trained on fully annotated corpora, i.e. the class of all words for all entities is known. Partially annotated corpora, i.e. some but not all entities of some types are annotated, are too noisy for training sequence taggers since the same entity may be annotated one time with its true type but not another time, misleading the tagger. Therefore, we are comparing three training strategies for partially annotated datasets and an approach to derive new datasets for new classes of entities from Wikipedia without time-consuming manual data annotation. In order to properly verify that our data acquisition and training approaches are plausible, we manually annotated test datasets for two new classes, namely food and drugs.

Performer · 模型評估 · 有偏 · 控制器 · 注意力機制 ·

2022 年 4 月 18 日

Feature-based intermittent demand forecast combinations: bias, accuracy and inventory implications

Li Li,Yanfei Kang,Fotios Petropoulos,Feng Li

Intermittent demand forecasting is a ubiquitous and challenging problem in operations and supply chain management. There has been a growing focus on developing forecasting approaches for intermittent demand from academic and practical perspectives in recent years. However, limited attention has been given to forecast combination methods, which have been proved to achieve competitive performance in forecasting fast-moving time series. The current study aims to examine the empirical outcomes of some existing forecast combination methods, and propose a generalized feature-based framework for intermittent demand forecasting. We conduct a simulation study to perform a large-scale comparison of a series of combination methods based on an intermittent demand classification scheme. Further, a real data set is used to investigate the forecasting performance and offer insights with regards the inventory performance of the proposed framework by considering some complementary error measures. The proposed framework leads to a significant improvement in forecast accuracy and offers the potential of flexibility and interpretability in inventory control.

視頻描述生成（Video Caption） · 可辨認的 · 端到端 · Integration · MoDELS ·

2022 年 4 月 18 日

End-to-end Dense Video Captioning as Sequence Generation

Wanrong Zhu,Bo Pang,Ashish Thapliyal,William Yang Wang,Radu Soricut

Dense video captioning aims to identify the events of interest in an input video, and generate descriptive captions for each event. Previous approaches usually follow a two-stage generative process, which first proposes a segment for each event, then renders a caption for each identified segment. Recent advances in large-scale sequence generation pretraining have seen great success in unifying task formulation for a great variety of tasks, but so far, more complex tasks such as dense video captioning are not able to fully utilize this powerful paradigm. In this work, we show how to model the two subtasks of dense video captioning jointly as one sequence generation task, and simultaneously predict the events and the corresponding descriptions. Experiments on YouCook2 and ViTT show encouraging results and indicate the feasibility of training complex tasks such as end-to-end dense video captioning integrated into large-scale pre-trained models.

語言模型化 · MoDELS · 去噪 · 自編碼器 · 縮放 ·

2022 年 4 月 16 日

METRO: Efficient Denoising Pretraining of Large Scale Autoencoding Language Models with Model Generated Signals

Payal Bajaj,Chenyan Xiong,Guolin Ke,Xiaodong Liu,Di He,Saurabh Tiwary,Tie-Yan Liu,Paul Bennett,Xia Song,Jianfeng Gao

from arxiv, Update details in scaled initialization and add acknowledgement

We present an efficient method of pretraining large-scale autoencoding language models using training signals generated by an auxiliary model. Originated in ELECTRA, this training strategy has demonstrated sample-efficiency to pretrain models at the scale of hundreds of millions of parameters. In this work, we conduct a comprehensive empirical study, and propose a recipe, namely "Model generated dEnoising TRaining Objective" (METRO), which incorporates some of the best modeling techniques developed recently to speed up, stabilize, and enhance pretrained language models without compromising model effectiveness. The resultant models, METRO-LM, consisting of up to 5.4 billion parameters, achieve new state-of-the-art on the GLUE, SuperGLUE, and SQuAD benchmarks. More importantly, METRO-LM are efficient in that they often outperform previous large models with significantly smaller model sizes and lower pretraining cost.

秩 · Weight · 集成 · 匯聚 · MoDELS ·

2022 年 4 月 15 日

Two-Step Meta-Learning for Time-Series Forecasting Ensemble

Evaldas Vaiciukynas,Paulius Danenas,Vilius Kontrimas,Rimantas Butleris

from arxiv, Accepted to IEEE Access journal in April 22, 2021

Amounts of historical data collected increase and business intelligence applicability with automatic forecasting of time series are in high demand. While no single time series modeling method is universal to all types of dynamics, forecasting using an ensemble of several methods is often seen as a compromise. Instead of fixing ensemble diversity and size, we propose to predict these aspects adaptively using meta-learning. Meta-learning here considers two separate random forest regression models, built on 390 time-series features, to rank 22 univariate forecasting methods and recommend ensemble size. The forecasting ensemble is consequently formed from methods ranked as the best, and forecasts are pooled using either simple or weighted average (with a weight corresponding to reciprocal rank). The proposed approach was tested on 12561 micro-economic time-series (expanded to 38633 for various forecasting horizons) of M4 competition where meta-learning outperformed Theta and Comb benchmarks by relative forecasting errors for all data types and horizons. Best overall results were achieved by weighted pooling with a symmetric mean absolute percentage error of 9.21% versus 11.05% obtained using the Theta method.

state-of-the-art · 值域 · 多樣性 · 峰值 · MoDELS ·

2022 年 1 月 5 日

Forecasting: theory and practice

Fotios Petropoulos,Daniele Apiletti,Vassilios Assimakopoulos,Mohamed Zied Babai,Devon K. Barrow,Souhaib Ben Taieb,Christoph Bergmeir,Ricardo J. Bessa,Jakub Bijak,John E. Boylan,Jethro Browell,Claudio Carnevale,Jennifer L. Castle,Pasquale Cirillo,Michael P. Clements,Clara Cordeiro,Fernando Luiz Cyrino Oliveira,Shari De Baets,Alexander Dokumentov,Joanne Ellison,Piotr Fiszeder,Philip Hans Franses,David T. Frazier,Michael Gilliland,M. Sinan G?nül,Paul Goodwin,Luigi Grossi,Yael Grushka-Cockayne,Mariangela Guidolin,Massimo Guidolin,Ulrich Gunter,Xiaojia Guo,Renato Guseo,Nigel Harvey,David F. Hendry,Ross Hollyman,Tim Januschowski,Jooyoung Jeon,Victor Richmond R. Jose,Yanfei Kang,Anne B. Koehler,Stephan Kolassa,Nikolaos Kourentzes,Sonia Leva,Feng Li,Konstantia Litsiou,Spyros Makridakis,Gael M. Martin,Andrew B. Martinez,Sheik Meeran,Theodore Modis,Konstantinos Nikolopoulos,Dilek ?nkal,Alessia Paccagnini,Anastasios Panagiotelis,Ioannis Panapakidis,Jose M. Pavía,Manuela Pedio,Diego J. Pedregal,Pierre Pinson,Patrícia Ramos,David E. Rapach,J. James Reade,Bahman Rostami-Tabar,Micha? Rubaszek,Georgios Sermpinis,Han Lin Shang,Evangelos Spiliotis,Aris A. Syntetos,Priyanga Dilini Talagala,Thiyanga S. Talagala,Len Tashman,Dimitrios Thomakos,Thordis Thorarinsdottir,Ezio Todini,Juan Ramón Trapero Arenas,Xiaoqian Wang,Robert L. Winkler,Alisa Yusupova,Florian Ziel

Forecasting has always been at the forefront of decision making and planning. The uncertainty that surrounds the future is both exciting and challenging, with individuals and organisations seeking to minimise risks and maximise utilities. The large number of forecasting applications calls for a diverse set of forecasting methods to tackle real-life challenges. This article provides a non-systematic review of the theory and the practice of forecasting. We provide an overview of a wide range of theoretical, state-of-the-art models, methods, principles, and approaches to prepare, produce, organise, and evaluate forecasts. We then demonstrate how such theoretical concepts are applied in a variety of real-life contexts. We do not claim that this review is an exhaustive list of methods and applications. However, we wish that our encyclopedic presentation will offer a point of reference for the rich work that has been undertaken over the last decades, with some key insights for the future of forecasting theory and practice. Given its encyclopedic nature, the intended mode of reading is non-linear. We offer cross-references to allow the readers to navigate through the various topics. We complement the theoretical concepts and applications covered by large lists of free or open-source software implementations and publicly-available databases.

估計/估計量 · 估計誤差 · MoDELS · 學成 · 無偏 ·

2020 年 12 月 17 日

The Causal Learning of Retail Delinquency

Yiyan Huang,Cheuk Hang Leung,Xing Yan,Qi Wu,Nanbo Peng,Dongdong Wang,Zhixiang Huang

from arxiv, This paper was accepted and will be published in the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI-21)

This paper focuses on the expected difference in borrower's repayment when there is a change in the lender's credit decisions. Classical estimators overlook the confounding effects and hence the estimation error can be magnificent. As such, we propose another approach to construct the estimators such that the error can be greatly reduced. The proposed estimators are shown to be unbiased, consistent, and robust through a combination of theoretical analysis and numerical testing. Moreover, we compare the power of estimating the causal quantities between the classical estimators and the proposed estimators. The comparison is tested across a wide range of models, including linear regression models, tree-based models, and neural network-based models, under different simulated datasets that exhibit different levels of causality, different degrees of nonlinearity, and different distributional properties. Most importantly, we apply our approaches to a large observational dataset provided by a global technology firm that operates in both the e-commerce and the lending business. We find that the relative reduction of estimation error is strikingly substantial if the causal effects are accounted for correctly.

簇 · Performer · 數據集 · MoDELS · DBSCAN ·

2019 年 10 月 30 日

Meta-Learning to Cluster

Yibo Jiang,Nakul Verma

Clustering is one of the most fundamental and wide-spread techniques in exploratory data analysis. Yet, the basic approach to clustering has not really changed: a practitioner hand-picks a task-specific clustering loss to optimize and fit the given data to reveal the underlying cluster structure. Some types of losses---such as k-means, or its non-linear version: kernelized k-means (centroid based), and DBSCAN (density based)---are popular choices due to their good empirical performance on a range of applications. Although every so often the clustering output using these standard losses fails to reveal the underlying structure, and the practitioner has to custom-design their own variation. In this work we take an intrinsically different approach to clustering: rather than fitting a dataset to a specific clustering loss, we train a recurrent model that learns how to cluster. The model uses as training pairs examples of datasets (as input) and its corresponding cluster identities (as output). By providing multiple types of training datasets as inputs, our model has the ability to generalize well on unseen datasets (new clustering tasks). Our experiments reveal that by training on simple synthetically generated datasets or on existing real datasets, we can achieve better clustering performance on unseen real-world datasets when compared with standard benchmark clustering techniques. Our meta clustering model works well even for small datasets where the usual deep learning models tend to perform worse.