国产成人精品三级在线_国产一区二区黑人_特婬女子婬乱视频一区二区三区_亚洲伊人无码一区二区在线播放_美国一级大黄片高潮喷水_亚洲一区二区免费在线观看_综合成人网友亚洲偷自拍

Debiased machine learning is a meta algorithm based on bias correction and sample splitting to calculate confidence intervals for functionals (i.e. scalar summaries) of machine learning algorithms. For example, an analyst may desire the confidence interval for a treatment effect estimated with a neural network. We provide a nonasymptotic debiased machine learning theorem that encompasses any global or local functional of any machine learning algorithm that satisfies a few simple, interpretable conditions. Formally, we prove consistency, Gaussian approximation, and semiparametric efficiency by finite sample arguments. The rate of convergence is $n^{-1/2}$ for global functionals, and it degrades gracefully for local functionals. Our results culminate in a simple set of conditions that an analyst can use to translate modern learning theory rates into traditional statistical inference. The conditions reveal a general double robustness property for ill posed inverse problems.

相關內容

Machine Learning

關注 2241

機器(qi)學(xue)(xue)習(xi)(xi)(xi)（Machine Learning）是一個研(yan)究(jiu)計算學(xue)(xue)習(xi)(xi)(xi)方(fang)(fang)法(fa)的(de)(de)國際論(lun)壇(tan)。該(gai)雜志發表(biao)文章，報告(gao)廣泛的(de)(de)學(xue)(xue)習(xi)(xi)(xi)方(fang)(fang)法(fa)應(ying)用(yong)(yong)(yong)于(yu)各(ge)種學(xue)(xue)習(xi)(xi)(xi)問題(ti)(ti)的(de)(de)實(shi)質(zhi)性結果。該(gai)雜志的(de)(de)特色論(lun)文描述研(yan)究(jiu)的(de)(de)問題(ti)(ti)和(he)方(fang)(fang)法(fa)，應(ying)用(yong)(yong)(yong)研(yan)究(jiu)和(he)研(yan)究(jiu)方(fang)(fang)法(fa)的(de)(de)問題(ti)(ti)。有(you)關學(xue)(xue)習(xi)(xi)(xi)問題(ti)(ti)或(huo)(huo)方(fang)(fang)法(fa)的(de)(de)論(lun)文通過實(shi)證研(yan)究(jiu)、理論(lun)分析或(huo)(huo)與心理現象的(de)(de)比較提供(gong)了(le)堅實(shi)的(de)(de)支(zhi)持(chi)。應(ying)用(yong)(yong)(yong)論(lun)文展示了(le)如何應(ying)用(yong)(yong)(yong)學(xue)(xue)習(xi)(xi)(xi)方(fang)(fang)法(fa)來(lai)解決重要的(de)(de)應(ying)用(yong)(yong)(yong)問題(ti)(ti)。研(yan)究(jiu)方(fang)(fang)法(fa)論(lun)文改進了(le)機器(qi)學(xue)(xue)習(xi)(xi)(xi)的(de)(de)研(yan)究(jiu)方(fang)(fang)法(fa)。所(suo)有(you)的(de)(de)論(lun)文都以其他研(yan)究(jiu)人員可以驗(yan)證或(huo)(huo)復制(zhi)的(de)(de)方(fang)(fang)式描述了(le)支(zhi)持(chi)證據(ju)。論(lun)文還詳(xiang)細說明了(le)學(xue)(xue)習(xi)(xi)(xi)的(de)(de)組(zu)成部分，并討論(lun)了(le)關于(yu)知識表(biao)示和(he)性能任務的(de)(de)假設。官網地址：

Performer · 在線 · 學成 · Networking · 可約的 ·

2022 年 4 月 20 日

Online Caching with Optimistic Learning

Naram Mhaisen,George Iosifidis,Douglas Leith

from arxiv, To appear in IFIP Networking 2022

The design of effective online caching policies is an increasingly important problem for content distribution networks, online social networks and edge computing services, among other areas. This paper proposes a new algorithmic toolbox for tackling this problem through the lens of optimistic online learning. We build upon the Follow-the-Regularized-Leader (FTRL) framework which is developed further here to include predictions for the file requests, and we design online caching algorithms for bipartite networks with fixed-size caches or elastic leased caches subject to time-average budget constraints. The predictions are provided by a content recommendation system that influences the users viewing activity, and hence can naturally reduce the caching network's uncertainty about future requests. We prove that the proposed optimistic learning caching policies can achieve sub-zero performance loss (regret) for perfect predictions, and maintain the best achievable regret bound $O(\sqrt T)$ even for arbitrary-bad predictions. The performance of the proposed algorithms is evaluated with detailed trace-driven numerical tests.

學成 · 部分可觀測馬爾可夫決策過程 · 過完備 · 強化學習 · 可辨認的 ·

2022 年 4 月 19 日

When Is Partially Observable Reinforcement Learning Not Scary?

Qinghua Liu,Alan Chung,Csaba Szepesvári,Chi Jin

Applications of Reinforcement Learning (RL), in which agents learn to make a sequence of decisions despite lacking complete information about the latent states of the controlled system, that is, they act under partial observability of the states, are ubiquitous. Partially observable RL can be notoriously difficult -- well-known information-theoretic results show that learning partially observable Markov decision processes (POMDPs) requires an exponential number of samples in the worst case. Yet, this does not rule out the existence of large subclasses of POMDPs over which learning is tractable. In this paper we identify such a subclass, which we call weakly revealing POMDPs. This family rules out the pathological instances of POMDPs where observations are uninformative to a degree that makes learning hard. We prove that for weakly revealing POMDPs, a simple algorithm combining optimism and Maximum Likelihood Estimation (MLE) is sufficient to guarantee polynomial sample complexity. To the best of our knowledge, this is the first provably sample-efficient result for learning from interactions in overcomplete POMDPs, where the number of latent states can be larger than the number of observations.

MoDELS · 學成 · 分離的 · 相互獨立的 · 黑盒 ·

2022 年 4 月 18 日

Separating Rule Discovery and Global Solution Composition in a Learning Classifier System

Michael Heider,Helena Stegherr,Jonathan Wurth,Roman Sraj,J?rg H?hner

from arxiv, Genetic and Evolutionary Computation Conference Companion (GECCO '22 Companion), July 9--13, 2022, Boston, MA, USA

While utilization of digital agents to support crucial decision making is increasing, trust in suggestions made by these agents is hard to achieve. However, it is essential to profit from their application, resulting in a need for explanations for both the decision making process and the model. For many systems, such as common black-box models, achieving at least some explainability requires complex post-processing, while other systems profit from being, to a reasonable extent, inherently interpretable. We propose a rule-based learning system specifically conceptualised and, thus, especially suited for these scenarios. Its models are inherently transparent and easily interpretable by design. One key innovation of our system is that the rules' conditions and which rules compose a problem's solution are evolved separately. We utilise independent rule fitnesses which allows users to specifically tailor their model structure to fit the given requirements for explainability.

估計/估計量 · Kronecker積 · 協方差矩陣 · Performer · 正則化項 ·

2022 年 4 月 18 日

Covariance Estimation for Matrix-valued Data

Yichi Zhang,Weining Shen,Dehan Kong

Covariance estimation for matrix-valued data has received an increasing interest in applications. Unlike previous works that rely heavily on matrix normal distribution assumption and the requirement of fixed matrix size, we propose a class of distribution-free regularized covariance estimation methods for high-dimensional matrix data under a separability condition and a bandable covariance structure. Under these conditions, the original covariance matrix is decomposed into a Kronecker product of two bandable small covariance matrices representing the variability over row and column directions. We formulate a unified framework for estimating bandable covariance, and introduce an efficient algorithm based on rank one unconstrained Kronecker product approximation. The convergence rates of the proposed estimators are established, and the derived minimax lower bound shows our proposed estimator is rate-optimal under certain divergence regimes of matrix size. We further introduce a class of robust covariance estimators and provide theoretical guarantees to deal with heavy-tailed data. We demonstrate the superior finite-sample performance of our methods using simulations and real applications from a gridded temperature anomalies dataset and a S&P 500 stock data analysis.

學成 · 表示學習 · 表示 · Machine Learning · 預測器/決策函數 ·

2022 年 4 月 18 日

Empirical Evaluation and Theoretical Analysis for Representation Learning: A Survey

Kento Nozawa,Issei Sato

from arxiv, The extended version of "Kento Nozawa and Issei Sato. Evaluation Methods for Representation Learning: A Survey. In IJCAI-ECAI Survey Track, 2022."

Representation learning enables us to automatically extract generic feature representations from a dataset to solve another machine learning task. Recently, extracted feature representations by a representation learning algorithm and a simple predictor have exhibited state-of-the-art performance on several machine learning tasks. Despite its remarkable progress, there exist various ways to evaluate representation learning algorithms depending on the application because of the flexibility of representation learning. To understand the current representation learning, we review evaluation methods of representation learning algorithms and theoretical analyses. On the basis of our evaluation survey, we also discuss the future direction of representation learning. Note that this survey is the extended version of Nozawa and Sato (2022).

可約的 · 服務器 · 邊 · Continuity · Performer ·

2022 年 4 月 15 日

Server Free Wireless Federated Learning: Architecture, Algorithm, and Analysis

Howard H. Yang,Zihan Chen,Tony Q. S. Quek

We demonstrate that merely analog transmissions and match filtering can realize the function of an edge server in federated learning (FL). Therefore, a network with massively distributed user equipments (UEs) can achieve large-scale FL without an edge server. We also develop a training algorithm that allows UEs to continuously perform local computing without being interrupted by the global parameter uploading, which exploits the full potential of UEs' processing power. We derive convergence rates for the proposed schemes to quantify their training efficiency. The analyses reveal that when the interference obeys a Gaussian distribution, the proposed algorithm retrieves the convergence rate of a server-based FL. But if the interference distribution is heavy-tailed, then the heavier the tail, the slower the algorithm converges. Nonetheless, the system run time can be largely reduced by enabling computation in parallel with communication, whereas the gain is particularly pronounced when communication latency is high. These findings are corroborated via excessive simulations.

離散化 · 極小點 · 路徑 · Performer · 計算成本 ·

2022 年 4 月 15 日

Convergence of the Discrete Minimum Energy Path

Xuanyu Liu,Huajie Chen,Christoph Ortner

from arxiv, arXiv admin note: text overlap with arXiv:2204.00984

The minimum energy path (MEP) describes the mechanism of reaction, and the energy barrier along the path can be used to calculate the reaction rate in thermal systems. The nudged elastic band (NEB) method is one of the most commonly used schemes to compute MEPs numerically. It approximates an MEP by a discrete set of configuration images, where the discretization size determines both computational cost and accuracy of the simulations. In this paper, we consider a discrete MEP to be a stationary state of the NEB method and prove an optimal convergence rate of the discrete MEP with respect to the number of images. Numerical simulations for the transitions of some several proto-typical model systems are performed to support the theory.

學成 · Neural Networks · 強化學習 · 深度強化學習 · 知識 (knowledge) ·

2022 年 4 月 14 日

Methodical Advice Collection and Reuse in Deep Reinforcement Learning

Sahir,Ercüment ?lhan,Srijita Das,Matthew E. Taylor

from arxiv, To be published in ALA2022: Adaptive and Learning Agents Workshop 2022 at AAMAS

Reinforcement learning (RL) has shown great success in solving many challenging tasks via use of deep neural networks. Although using deep learning for RL brings immense representational power, it also causes a well-known sample-inefficiency problem. This means that the algorithms are data-hungry and require millions of training samples to converge to an adequate policy. One way to combat this issue is to use action advising in a teacher-student framework, where a knowledgeable teacher provides action advice to help the student. This work considers how to better leverage uncertainties about when a student should ask for advice and if the student can model the teacher to ask for less advice. The student could decide to ask for advice when it is uncertain or when both it and its model of the teacher are uncertain. In addition to this investigation, this paper introduces a new method to compute uncertainty for a deep RL agent using a secondary neural network. Our empirical results show that using dual uncertainties to drive advice collection and reuse may improve learning performance across several Atari games.

INFORMS · 表示定理 · 可交換的 · 相對熵 · 查全率/召回率 ·

2022 年 4 月 14 日

Information in probability: Another information-theoretic proof of a finite de Finetti theorem

Lampros Gavalakis,Ioannis Kontoyiannis

from arxiv, Small changes from the previous version, including a few more references and clarifications in the Introduction

We recall some of the history of the information-theoretic approach to deriving core results in probability theory and indicate parts of the recent resurgence of interest in this area with current progress along several interesting directions. Then we give a new information-theoretic proof of a finite version of de Finetti's classical representation theorem for finite-valued random variables. We derive an upper bound on the relative entropy between the distribution of the first $k$ in a sequence of $n$ exchangeable random variables, and an appropriate mixture over product distributions. The mixing measure is characterised as the law of the empirical measure of the original sequence, and de Finetti's result is recovered as a corollary. The proof is nicely motivated by the Gibbs conditioning principle in connection with statistical mechanics, and it follows along an appealing sequence of steps. The technical estimates required for these steps are obtained via the use of a collection of combinatorial tools known within information theory as `the method of types.'

學成 · 替代損失 · 在線 · Bandits · 賭博機/老虎機 ·

2019 年 12 月 31 日

A Modern Introduction to Online Learning

Francesco Orabona

In this monograph, I introduce the basic concepts of Online Learning through a modern view of Online Convex Optimization. Here, online learning refers to the framework of regret minimization under worst-case assumptions. I present first-order and second-order algorithms for online learning with convex losses, in Euclidean and non-Euclidean settings. All the algorithms are clearly presented as instantiation of Online Mirror Descent or Follow-The-Regularized-Leader and their variants. Particular attention is given to the issue of tuning the parameters of the algorithms and learning in unbounded domains, through adaptive and parameter-free online learning algorithms. Non-convex losses are dealt through convex surrogate losses and through randomization. The bandit setting is also briefly discussed, touching on the problem of adversarial and stochastic multi-armed bandits. These notes do not require prior knowledge of convex analysis and all the required mathematical tools are rigorously explained. Moreover, all the proofs have been carefully chosen to be as simple and as short as possible.