动漫AV观看网站不卡无码_男女一边脱一边亲一边膜_99久热这里精品免费观看_欧美WW在线观看_亚洲精品中文字幕无码专区纸_欧美一区二区在线免费看_亚洲综合一区二区三区在线播放

We propose a novel prediction interval method to learn prediction mean values, lower and upper bounds of prediction intervals from three independently trained neural networks only using the standard mean squared error (MSE) loss, for uncertainty quantification in regression tasks. Our method requires no distributional assumption on data, does not introduce unusual hyperparameters to either the neural network models or the loss function. Moreover, our method can effectively identify out-of-distribution samples and reasonably quantify their uncertainty. Numerical experiments on benchmark regression problems show that our method outperforms the state-of-the-art methods with respect to predictive uncertainty quality, robustness, and identification of out-of-distribution samples.

相關內容

Neural Networks

關注 1648

神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)(jing)網絡(luo)（Neural Networks）是世界上(shang)三(san)個最古老的(de)(de)(de)(de)(de)(de)神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)(jing)建(jian)模學(xue)(xue)(xue)(xue)會(hui)的(de)(de)(de)(de)(de)(de)檔(dang)案(an)期刊:國際(ji)神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)(jing)網絡(luo)學(xue)(xue)(xue)(xue)會(hui)(INNS)、歐(ou)洲神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)(jing)網絡(luo)學(xue)(xue)(xue)(xue)會(hui)(ENNS)和(he)日本神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)(jing)網絡(luo)學(xue)(xue)(xue)(xue)會(hui)(JNNS)。神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)(jing)網絡(luo)提供了一(yi)個論壇，以發(fa)展(zhan)(zhan)和(he)培育一(yi)個國際(ji)社(she)(she)會(hui)的(de)(de)(de)(de)(de)(de)學(xue)(xue)(xue)(xue)者(zhe)(zhe)和(he)實(shi)踐者(zhe)(zhe)感興趣的(de)(de)(de)(de)(de)(de)所有方(fang)面(mian)的(de)(de)(de)(de)(de)(de)神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)(jing)網絡(luo)和(he)相(xiang)關(guan)方(fang)法的(de)(de)(de)(de)(de)(de)計(ji)算(suan)智(zhi)能。神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)(jing)網絡(luo)歡迎高質量(liang)(liang)論文(wen)的(de)(de)(de)(de)(de)(de)提交(jiao)，有助于(yu)(yu)全面(mian)的(de)(de)(de)(de)(de)(de)神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)(jing)網絡(luo)研究，從行為和(he)大(da)腦建(jian)模，學(xue)(xue)(xue)(xue)習算(suan)法，通過數(shu)學(xue)(xue)(xue)(xue)和(he)計(ji)算(suan)分析，系(xi)統的(de)(de)(de)(de)(de)(de)工程(cheng)和(he)技術(shu)應用，大(da)量(liang)(liang)使(shi)用神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)(jing)網絡(luo)的(de)(de)(de)(de)(de)(de)概(gai)念和(he)技術(shu)。這一(yi)獨特而廣泛的(de)(de)(de)(de)(de)(de)范圍促(cu)進了生物(wu)(wu)和(he)技術(shu)研究之(zhi)(zhi)間的(de)(de)(de)(de)(de)(de)思想(xiang)交(jiao)流，并(bing)有助于(yu)(yu)促(cu)進對(dui)生物(wu)(wu)啟發(fa)的(de)(de)(de)(de)(de)(de)計(ji)算(suan)智(zhi)能感興趣的(de)(de)(de)(de)(de)(de)跨(kua)學(xue)(xue)(xue)(xue)科社(she)(she)區的(de)(de)(de)(de)(de)(de)發(fa)展(zhan)(zhan)。因此，神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)(jing)網絡(luo)編委會(hui)代表(biao)的(de)(de)(de)(de)(de)(de)專家領域包括心理(li)學(xue)(xue)(xue)(xue)，神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)(jing)生物(wu)(wu)學(xue)(xue)(xue)(xue)，計(ji)算(suan)機(ji)科學(xue)(xue)(xue)(xue)，工程(cheng)，數(shu)學(xue)(xue)(xue)(xue)，物(wu)(wu)理(li)。該雜志發(fa)表(biao)文(wen)章、信(xin)件和(he)評論以及給編輯的(de)(de)(de)(de)(de)(de)信(xin)件、社(she)(she)論、時事、軟(ruan)件調查和(he)專利信(xin)息。文(wen)章發(fa)表(biao)在五個部分之(zhi)(zhi)一(yi):認(ren)知(zhi)科學(xue)(xue)(xue)(xue)，神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)(jing)科學(xue)(xue)(xue)(xue)，學(xue)(xue)(xue)(xue)習系(xi)統，數(shu)學(xue)(xue)(xue)(xue)和(he)計(ji)算(suan)分析、工程(cheng)和(he)應用。官網地(di)址：

判別器 · 頻率主義學派 · MoDELS · Performer · 模型性能 ·

2021 年 10 月 5 日

Locally Valid and Discriminative Prediction Intervals for Deep Learning Models

Zhen Lin,Shubhendu Trivedi,Jimeng Sun

from arxiv, Accepted by NeurIPS 2021. Code is uploaded to //github.com/zlin7/LVD

Crucial for building trust in deep learning models for critical real-world applications is efficient and theoretically sound uncertainty quantification, a task that continues to be challenging. Useful uncertainty information is expected to have two key properties: It should be valid (guaranteeing coverage) and discriminative (more uncertain when the expected risk is high). Moreover, when combined with deep learning (DL) methods, it should be scalable and affect the DL model performance minimally. Most existing Bayesian methods lack frequentist coverage guarantees and usually affect model performance. The few available frequentist methods are rarely discriminative and/or violate coverage guarantees due to unrealistic assumptions. Moreover, many methods are expensive or require substantial modifications to the base neural network. Building upon recent advances in conformal prediction [13, 32] and leveraging the classical idea of kernel regression, we propose Locally Valid and Discriminative prediction intervals (LVD), a simple, efficient and lightweight method to construct discriminative prediction intervals (PIs) for almost any DL model. With no assumptions on the data distribution, such PIs also offer finite-sample local coverage guarantees (contrasted to the simpler marginal coverage). We empirically verify, using diverse datasets, that besides being the only locally valid method for DL, LVD also exceeds or matches the performance (including coverage rate and prediction accuracy) of existing uncertainty quantification methods, while offering additional benefits in scalability and flexibility.

預測器/決策函數 · 循環神經網絡 · Neural Networks · Networking · AIM ·

2021 年 10 月 4 日

Prediction of mmWave/THz Link Blockages through Meta-Learning and Recurrent Neural Networks

Anders E. Kal?r,Osvaldo Simeone,Petar Popovski

Wireless applications that use high-reliability low-latency links depend critically on the capability of the system to predict link quality. This dependence is especially acute at the high carrier frequencies used by mmWave and THz systems, where the links are susceptible to blockages. Predicting blockages with high reliability requires a large number of data samples to train effective machine learning modules. With the aim of mitigating data requirements, we introduce a framework based on meta-learning, whereby data from distinct deployments are leveraged to optimize a shared initialization that decreases the data set size necessary for any new deployment. Predictors of two different events are studied: (1) at least one blockage occurs in a time window, and (2) the link is blocked for the entire time window. The results show that an RNN-based predictor trained using meta-learning is able to predict blockages after observing fewer samples than predictors trained using standard methods.

置信度 · 自助法/自舉法 · 似然 · 最大似然估計 · 近似 ·

2021 年 10 月 4 日

Confidence Intervals for Seroprevalence

Thomas J. DiCiccio,David M. Ritzwoller,Joseph P. Romano,Azeem M. Shaikh

This paper concerns the construction of confidence intervals in standard seroprevalence surveys. In particular, we discuss methods for constructing confidence intervals for the proportion of individuals in a population infected with a disease using a sample of antibody test results and measurements of the test's false positive and false negative rates. We begin by documenting erratic behavior in the coverage probabilities of standard Wald and percentile bootstrap intervals when applied to this problem. We then consider two alternative sets of intervals constructed with test inversion. The first set of intervals are approximate, using either asymptotic or bootstrap approximation to the finite-sample distribution of a chosen test statistic. We consider several choices of test statistic, including maximum likelihood estimators and generalized likelihood ratio statistics. We show with simulation that, at empirically relevant parameter values and sample sizes, the coverage probabilities for these intervals are close to their nominal level and are approximately equi-tailed. The second set of intervals are shown to contain the true parameter value with probability at least equal to the nominal level, but can be conservative in finite samples.

Neural Networks · Networking · INFORMS · 概率密度函數 · 輸出 ·

2021 年 10 月 4 日

Combining distribution-based neural networks to predict weather forecast probabilities

Mariana Clare,Omar Jamil,Cyril Morcrette

from arxiv, 21 pages, 14 figures, Github repository: //github.com/mc4117/ResNet_Weather, Submitted to Quarterly Journal of the Royal Meteorological Society

The success of deep learning techniques over the last decades has opened up a new avenue of research for weather forecasting. Here, we take the novel approach of using a neural network to predict full probability density functions at each point in space and time rather than a single output value, thus producing a probabilistic weather forecast. This enables the calculation of both uncertainty and skill metrics for the neural network predictions, and overcomes the common difficulty of inferring uncertainty from these predictions. This approach is data-driven and the neural network is trained on the WeatherBench dataset (processed ERA5 data) to forecast geopotential and temperature 3 and 5 days ahead. Data exploration leads to the identification of the most important input variables, which are also found to agree with physical reasoning, thereby validating our approach. In order to increase computational efficiency further, each neural network is trained on a small subset of these variables. The outputs are then combined through a stacked neural network, the first time such a technique has been applied to weather data. Our approach is found to be more accurate than some numerical weather prediction models and as accurate as more complex alternative neural networks, with the added benefit of providing key probabilistic information necessary for making informed weather forecasts.

自助法/自舉法 · contrastive · 張成子空間 · 平穩的 · Boosting（一種模型訓練加速方式） ·

2021 年 10 月 1 日

Long-term prediction intervals with many covariates

Sayar Karmakar,Marek Chudy,Wei Biao Wu

Accurate forecasting is one of the fundamental focus in the literature of econometric time-series. Often practitioners and policy makers want to predict outcomes of an entire time horizon in the future instead of just a single $k$-step ahead prediction. These series, apart from their own possible non-linear dependence, are often also influenced by many external predictors. In this paper, we construct prediction intervals of time-aggregated forecasts in a high-dimensional regression setting. Our approach is based on quantiles of residuals obtained by the popular LASSO routine. We allow for general heavy-tailed, long-memory, and nonlinear stationary error process and stochastic predictors. Through a series of systematically arranged consistency results we provide theoretical guarantees of our proposed quantile-based method in all of these scenarios. After validating our approach using simulations we also propose a novel bootstrap based method that can boost the coverage of the theoretical intervals. Finally analyzing the EPEX Spot data, we construct prediction intervals for hourly electricity prices over horizons spanning 17 weeks and contrast them to selected Bayesian and bootstrap interval forecasts.

語言模型化 · MoDELS · 掩碼 · 詞元分析器 · 掩碼語言模型化 ·

2021 年 6 月 11 日

Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment

Zewen Chi,Li Dong,Bo Zheng,Shaohan Huang,Xian-Ling Mao,Heyan Huang,Furu Wei

from arxiv, ACL-2021

The cross-lingual language models are typically pretrained with masked language modeling on multilingual text or parallel sentences. In this paper, we introduce denoising word alignment as a new cross-lingual pre-training task. Specifically, the model first self-labels word alignments for parallel sentences. Then we randomly mask tokens in a bitext pair. Given a masked token, the model uses a pointer network to predict the aligned token in the other language. We alternately perform the above two steps in an expectation-maximization manner. Experimental results show that our method improves cross-lingual transferability on various datasets, especially on the token-level tasks, such as question answering, and structured prediction. Moreover, the model can serve as a pretrained word aligner, which achieves reasonably low error rates on the alignment benchmarks. The code and pretrained parameters are available at //github.com/CZWin32768/XLM-Align.

entity · 歸納偏好 · 圖 · 情景 · 知識圖譜 ·

2020 年 2 月 12 日

Inductive Relation Prediction by Subgraph Reasoning

Komal K. Teru,Etienne Denis,William L. Hamilton

The dominant paradigm for relation prediction in knowledge graphs involves learning and operating on latent representations (i.e., embeddings) of entities and relations. However, these embedding-based methods do not explicitly capture the compositional logical rules underlying the knowledge graph, and they are limited to the transductive setting, where the full set of entities must be known during training. Here, we propose a graph neural network based relation prediction framework, GraIL, that reasons over local subgraph structures and has a strong inductive bias to learn entity-independent relational semantics. Unlike embedding-based models, GraIL is naturally inductive and can generalize to unseen entities and graphs after training. We provide theoretical proof and strong empirical evidence that GraIL can represent a useful subset of first-order logic and show that GraIL outperforms existing rule-induction baselines in the inductive setting. We also demonstrate significant gains obtained by ensembling GraIL with various knowledge graph embedding methods in the transductive setting, highlighting the complementary inductive bias of our method.

RNN · 循環神經網絡 · Neural Networks · MoDELS · MINE ·

2019 年 3 月 27 日

On Attribution of Recurrent Neural Network Predictions via Additive Decomposition

Mengnan Du,Ninghao Liu,Fan Yang,Shuiwang Ji,Xia Hu

from arxiv, The 2019 Web Conference (WWW 2019)

RNN models have achieved the state-of-the-art performance in a wide range of text mining tasks. However, these models are often regarded as black-boxes and are criticized due to the lack of interpretability. In this paper, we enhance the interpretability of RNNs by providing interpretable rationales for RNN predictions. Nevertheless, interpreting RNNs is a challenging problem. Firstly, unlike existing methods that rely on local approximation, we aim to provide rationales that are more faithful to the decision making process of RNN models. Secondly, a flexible interpretation method should be able to assign contribution scores to text segments of varying lengths, instead of only to individual words. To tackle these challenges, we propose a novel attribution method, called REAT, to provide interpretations to RNN predictions. REAT decomposes the final prediction of a RNN into additive contribution of each word in the input text. This additive decomposition enables REAT to further obtain phrase-level attribution scores. In addition, REAT is generally applicable to various RNN architectures, including GRU, LSTM and their bidirectional versions. Experimental results demonstrate the faithfulness and interpretability of the proposed attribution method. Comprehensive analysis shows that our attribution method could unveil the useful linguistic knowledge captured by RNNs. Some analysis further demonstrates our method could be utilized as a debugging tool to examine the vulnerability and failure reasons of RNNs, which may lead to several promising future directions to promote generalization ability of RNNs.

序列到序列學習 · seq2seq · 優化器 · MoDELS · Performer ·

2018 年 5 月 24 日

Classical Structured Prediction Losses for Sequence to Sequence Learning

Sergey Edunov,Myle Ott,Michael Auli,David Grangier,Marc'Aurelio Ranzato

from arxiv, 10 pages

There has been much recent work on training neural attention models at the sequence-level using either reinforcement learning-style methods or by optimizing the beam. In this paper, we survey a range of classical objective functions that have been widely used to train linear models for structured prediction and apply them to neural sequence to sequence models. Our experiments show that these losses can perform surprisingly well by slightly outperforming beam search optimization in a like for like setup. We also report new state of the art results on both IWSLT'14 German-English translation as well as Gigaword abstractive summarization. On the larger WMT'14 English-French translation task, sequence-level training achieves 41.5 BLEU which is on par with the state of the art.

話題模型 · MoDELS · 推斷 · 離散化 · 向量空間 ·

2018 年 5 月 21 日

Discovering Discrete Latent Topics with Neural Variational Inference

Yishu Miao,Edward Grefenstette,Phil Blunsom

from arxiv, ICML 2017

Topic models have been widely explored as probabilistic generative models of documents. Traditional inference methods have sought closed-form derivations for updating the models, however as the expressiveness of these models grows, so does the difficulty of performing fast and accurate inference over their parameters. This paper presents alternative neural approaches to topic modelling by providing parameterisable distributions over topics which permit training by backpropagation in the framework of neural variational inference. In addition, with the help of a stick-breaking construction, we propose a recurrent network that is able to discover a notionally unbounded number of topics, analogous to Bayesian non-parametric topic models. Experimental results on the MXM Song Lyrics, 20NewsGroups and Reuters News datasets demonstrate the effectiveness and efficiency of these neural topic models.