亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<tr id='8m973'><strong id='8m973'></strong><small id='8m973'></small><button id='8m973'></button><li id='8m973'><noscript id='8m973'><big id='8m973'></big><dt id='8m973'></dt></noscript></li></tr><ol id='8m973'><option id='8m973'><table id='8m973'><blockquote id='8m973'><tbody id='8m973'></tbody></blockquote></table></option></ol><u id='8m973'></u><kbd id='8m973'><kbd id='8m973'></kbd></kbd>

<code id='8m973'><strong id='8m973'></strong></code>

<fieldset id='8m973'></fieldset>

<span id='8m973'></span>

<ins id='8m973'></ins>

<acronym id='8m973'><em id='8m973'></em><td id='8m973'><div id='8m973'></div></td></acronym><address id='8m973'><big id='8m973'><big id='8m973'></big><legend id='8m973'></legend></big></address>

<i id='8m973'><div id='8m973'><ins id='8m973'></ins></div></i>

<i id='8m973'></i>

·

異方差 · 估計/估計量 · 極大似然 · 似然 · 優化器 ·

2021 年 9 月 8 日

Multivariate, Heteroscedastic Empirical Bayes via Nonparametric Maximum Likelihood

Jake A. Soloff,Adityanand Guntuboyina,Bodhisattva Sen

Multivariate, heteroscedastic errors complicate statistical inference in many large-scale denoising problems. Empirical Bayes is attractive in such settings, but standard parametric approaches rest on assumptions about the form of the prior distribution which can be hard to justify and which introduce unnecessary tuning parameters. We extend the nonparametric maximum likelihood estimator (NPMLE) for Gaussian location mixture densities to allow for multivariate, heteroscedastic errors. NPMLEs estimate an arbitrary prior by solving an infinite-dimensional, convex optimization problem; we show that this convex optimization problem can be tractably approximated by a finite-dimensional version. We introduce a dual mixture density whose modes contain the atoms of every NPMLE, and we leverage the dual both to show non-uniqueness in multivariate settings as well as to construct explicit bounds on the support of the NPMLE. The empirical Bayes posterior means based on an NPMLE have low regret, meaning they closely target the oracle posterior means one would compute with the true prior in hand. We prove an oracle inequality implying that the empirical Bayes estimator performs at nearly the optimal level (up to logarithmic factors) for denoising without prior knowledge. We provide finite-sample bounds on the average Hellinger accuracy of an NPMLE for estimating the marginal densities of the observations. We also demonstrate the adaptive and nearly-optimal properties of NPMLEs for deconvolution. We apply the method to two astronomy datasets, constructing a fully data-driven color-magnitude diagram of 1.4 million stars in the Milky Way and investigating the distribution of chemical abundance ratios for 27 thousand stars in the red clump.

相關內容

異方差

賭博機/老虎機 · CASE · 情景 · 樣本 · CASES ·

2021 年 10 月 29 日

Variational Bayesian Optimistic Sampling

Brendan O'Donoghue,Tor Lattimore

We consider online sequential decision problems where an agent must balance exploration and exploitation. We derive a set of Bayesian `optimistic' policies which, in the stochastic multi-armed bandit case, includes the Thompson sampling policy. We provide a new analysis showing that any algorithm producing policies in the optimistic set enjoys $\tilde O(\sqrt{AT})$ Bayesian regret for a problem with $A$ actions after $T$ rounds. We extend the regret analysis for optimistic policies to bilinear saddle-point problems which include zero-sum matrix games and constrained bandits as special cases. In this case we show that Thompson sampling can produce policies outside of the optimistic set and suffer linear regret in some instances. Finding a policy inside the optimistic set amounts to solving a convex optimization problem and we call the resulting algorithm `variational Bayesian optimistic sampling' (VBOS). The procedure works for any posteriors, \ie, it does not require the posterior to have any special properties, such as log-concavity, unimodality, or smoothness. The variational view of the problem has many useful properties, including the ability to tune the exploration-exploitation tradeoff, add regularization, incorporate constraints, and linearly parameterize the policy.

估計/估計量 · 方陣 · Extensibility · MoDELS · Continuity ·

2021 年 10 月 29 日

Nonparametric Estimation for I.I.D. Paths of a Martingale Driven Model with Application to Non-Autonomous Fractional SDE

from arxiv, 16 pages, 3 figures

This paper deals with a projection least square estimator of the function $J_0$ computed from multiple independent observations on $[0,T]$ of the process $Z$ defined by $dZ_t = J_0(t)d\langle M\rangle_t + dM_t$, where $M$ is a centered, continuous and square integrable martingale vanishing at $0$. Risk bounds are established on this estimator and on an associated adaptive estimator. An appropriate transformation allows to rewrite the differential equation $dX_t = V(X_t)(b_0(t)dt +\sigma(t)dB_t)$, where $B$ is a fractional Brownian motion of Hurst parameter $H\in (1/2,1)$, as a model of the previous type. So, the second part of the paper deals with risk bounds on a nonparametric estimator of $b_0$ derived from the results on the projection least square estimator of $J_0$. In particular, our results apply to the estimation of the drift function in a non-autonomous extension of the fractional Black-Scholes model introduced in Hu et al. (2003).

估計/估計量 · 配分函數 · 泛函 · 規范化因子 · 樣本 ·

2021 年 10 月 27 日

Parameter estimation for Gibbs distributions

David G. Harris,Vladimir Kolmogorov

from arxiv, This is a significantly extended version of a paper "A Faster Approximation Algorithm for the Gibbs Partition Function" (arXiv:1608.04223), which was published in COLT 2018. It covers many additional topics; most importantly, algorithms to estimate counts and algorithm specialized for integer-valued distributions

We consider \emph{Gibbs distributions}, which are families of probability distributions over a discrete space $\Omega$ with probability mass function of the form $\mu^\Omega_\beta(\omega) \propto e^{\beta H(\omega)}$ for $\beta$ in an interval $[\beta_{\min}, \beta_{\max}]$ and $H( \omega ) \in \{0 \} \cup [1, n]$. The \emph{partition function} is the normalization factor $Z(\beta)=\sum_{\omega \in\Omega}e^{\beta H(\omega)}$. Two important parameters of these distributions are the log partition ratio $q = \log \tfrac{Z(\beta_{\max})}{Z(\beta_{\min})}$ and the counts $c_x = |H^{-1}(x)|$. These are correlated with system parameters in a number of physical applications and sampling algorithms. Our first main result is to estimate the counts $c_x$ using roughly $\tilde O( \frac{q}{\varepsilon^2})$ samples for general Gibbs distributions and $\tilde O( \frac{n^2}{\varepsilon^2} )$ samples for integer-valued distributions (ignoring some second-order terms and parameters), and we show this is optimal up to logarithmic factors. We illustrate with improved algorithms for counting connected subgraphs and perfect matchings in a graph. We develop a key subroutine to estimate the partition function $Z$. Specifically, it generates a data structure to estimate $Z(\beta)$ for \emph{all} values $\beta$, without further samples. Constructing the data structure requires $O(\frac{q \log n}{\varepsilon^2})$ samples for general Gibbs distributions and $O(\frac{n^2 \log n}{\varepsilon^2} + n \log q)$ samples for integer-valued distributions. This improves over a prior algorithm of Huber (2015) which computes a single point estimate $Z(\beta_\max)$ using $O( q \log n( \log q + \log \log n + \varepsilon^{-2}))$ samples. We show matching lower bounds, demonstrating that this complexity is optimal as a function of $n$ and $q$ up to logarithmic terms.

優化器 · 正則化項 · 類別 · 分離的 · 約束 ·

2021 年 10 月 27 日

Minimax Optimal Quantile and Semi-Adversarial Regret via Root-Logarithmic Regularizers

Jeffrey Negrea,Blair Bilodeau,Nicolò Campolongo,Francesco Orabona,Daniel M. Roy

from arxiv, 30 pages, 2 figures. Jeffrey Negrea and Blair Bilodeau are equal-contribution authors

Quantile (and, more generally, KL) regret bounds, such as those achieved by NormalHedge (Chaudhuri, Freund, and Hsu 2009) and its variants, relax the goal of competing against the best individual expert to only competing against a majority of experts on adversarial data. More recently, the semi-adversarial paradigm (Bilodeau, Negrea, and Roy 2020) provides an alternative relaxation of adversarial online learning by considering data that may be neither fully adversarial nor stochastic (i.i.d.). We achieve the minimax optimal regret in both paradigms using FTRL with separate, novel, root-logarithmic regularizers, both of which can be interpreted as yielding variants of NormalHedge. We extend existing KL regret upper bounds, which hold uniformly over target distributions, to possibly uncountable expert classes with arbitrary priors; provide the first full-information lower bounds for quantile regret on finite expert classes (which are tight); and provide an adaptively minimax optimal algorithm for the semi-adversarial paradigm that adapts to the true, unknown constraint faster, leading to uniformly improved regret bounds over existing methods.

情景 · CASE · Performer · Better · 學成 ·

2021 年 10 月 27 日

How Tight Can PAC-Bayes be in the Small Data Regime?

Andrew Y. K. Foong,Wessel P. Bruinsma,David R. Burt,Richard E. Turner

from arxiv, Accepted for publication at Neural Information Processing Systems 2021

In this paper, we investigate the question: Given a small number of datapoints, for example N = 30, how tight can PAC-Bayes and test set bounds be made? For such small datasets, test set bounds adversely affect generalisation performance by withholding data from the training procedure. In this setting, PAC-Bayes bounds are especially attractive, due to their ability to use all the data to simultaneously learn a posterior and bound its generalisation risk. We focus on the case of i.i.d. data with a bounded loss and consider the generic PAC-Bayes theorem of Germain et al. While their theorem is known to recover many existing PAC-Bayes bounds, it is unclear what the tightest bound derivable from their framework is. For a fixed learning algorithm and dataset, we show that the tightest possible bound coincides with a bound considered by Catoni; and, in the more natural case of distributions over datasets, we establish a lower bound on the best bound achievable in expectation. Interestingly, this lower bound recovers the Chernoff test set bound if the posterior is equal to the prior. Moreover, to illustrate how tight these bounds can be, we study synthetic one-dimensional classification tasks in which it is feasible to meta-learn both the prior and the form of the bound to numerically optimise for the tightest bounds possible. We find that in this simple, controlled scenario, PAC-Bayes bounds are competitive with comparable, commonly used Chernoff test set bounds. However, the sharpest test set bounds still lead to better guarantees on the generalisation error than the PAC-Bayes bounds we consider.

經驗風險 · 經驗風險最小化 · 學成 · 可約的 · 最優化 ·

2021 年 10 月 27 日

Concentration of Non-Isotropic Random Tensors with Applications to Learning and Empirical Risk Minimization

Mathieu Even,Laurent Massoulié

Dimension is an inherent bottleneck to some modern learning tasks, where optimization methods suffer from the size of the data. In this paper, we study non-isotropic distributions of data and develop tools that aim at reducing these dimensional costs by a dependency on an effective dimension rather than the ambient one. Based on non-asymptotic estimates of the metric entropy of ellipsoids -- that prove to generalize to infinite dimensions -- and on a chaining argument, our uniform concentration bounds involve an effective dimension instead of the global dimension, improving over existing results. We show the importance of taking advantage of non-isotropic properties in learning problems with the following applications: i) we improve state-of-the-art results in statistical preconditioning for communication-efficient distributed optimization, ii) we introduce a non-isotropic randomized smoothing for non-smooth optimization. Both applications cover a class of functions that encompasses empirical risk minization (ERM) for linear models.

估計/估計量 · 稀疏 · 泛函 · 幾乎必然 · PDE ·

2021 年 10 月 27 日

Nonparametric Estimation for SDE with Sparsely Sampled Paths: an FDA Perspective

Neda Mohammadi,Leonardo Santoro,Victor M. Panaretos

We consider the problem of nonparametric estimation of the drift and diffusion coefficients of a Stochastic Differential Equation (SDE), based on $n$ independent replicates $\left\{X_i(t)\::\: t\in [0,1]\right\}_{1 \leq i \leq n}$, observed sparsely and irregularly on the unit interval, and subject to additive noise corruption. By \textit{sparse} we intend to mean that the number of measurements per path can be arbitrary (as small as two), and remain constant with respect to $n$. We focus on time-inhomogeneous SDE of the form $dX_t = \mu(t)X_t^{\alpha}dt + \sigma(t)X_t^{\beta}dW_t$, where $\alpha \in \{0,1\}$ and $\beta \in \{0,1/2,1\}$, which includes prominent examples such as Brownian motion, Ornstein-Uhlenbeck process, geometric Brownian motion, and Brownian bridge. Our estimators are constructed by relating the local (drift/diffusion) parameters of the diffusion to their global parameters (mean/covariance, and their derivatives) by means of an apparently novel PDE. This allows us to use methods inspired by functional data analysis, and pool information across the sparsely measured paths. The methodology we develop is fully non-parametric and avoids any functional form specification on the time-dependency of either the drift function or the diffusion function. We establish almost sure uniform asymptotic convergence rates of the proposed estimators as the number of observed curves $n$ grows to infinity. Our rates are non-asymptotic in the number of measurements per path, explicitly reflecting how different sampling frequency might affect the speed of convergence. Our framework suggests possible further fruitful interactions between FDA and SDE methods in problems with replication.

估計/估計量 · 行 · 特化 · 列 · 優化器 ·

2021 年 10 月 26 日

Nonparametric Matrix Estimation with One-Sided Covariates

Christina Lee Yu

Consider the task of matrix estimation in which a dataset $X \in \mathbb{R}^{n\times m}$ is observed with sparsity $p$, and we would like to estimate $\mathbb{E}[X]$, where $\mathbb{E}[X_{ui}] = f(\alpha_u, \beta_i)$ for some Holder smooth function $f$. We consider the setting where the row covariates $\alpha$ are unobserved yet the column covariates $\beta$ are observed. We provide an algorithm and accompanying analysis which shows that our algorithm improves upon naively estimating each row separately when the number of rows is not too small. Furthermore when the matrix is moderately proportioned, our algorithm achieves the minimax optimal nonparametric rate of an oracle algorithm that knows the row covariates. In simulated experiments we show our algorithm outperforms other baselines in low data regimes.

似然 · 估計/估計量 · 最大似然估計 · 極大似然 · MoDELS ·

2018 年 9 月 24 日

Implicit Maximum Likelihood Estimation

Ke Li,Jitendra Malik

from arxiv, 21 pages, 4 figures. In the interest of promoting discussion, we make the reviews available at //people.eecs.berkeley.edu/~ke.li/papers/imle_reviews.pdf

Implicit probabilistic models are models defined naturally in terms of a sampling procedure and often induces a likelihood function that cannot be expressed explicitly. We develop a simple method for estimating parameters in implicit models that does not require knowledge of the form of the likelihood function or any derived quantities, but can be shown to be equivalent to maximizing likelihood under some conditions. Our result holds in the non-asymptotic parametric setting, where both the capacity of the model and the number of data examples are finite. We also demonstrate encouraging experimental results.

離散化 · 馬爾可夫鏈蒙特卡羅 · 潛在 · 可交換的 · 話題模型 ·

2018 年 1 月 15 日

Latent nested nonparametric priors

Federico Camerlenghi,David B. Dunson,Antonio Lijoi,Igor Prünster,Abel Rodríguez

Discrete random structures are important tools in Bayesian nonparametrics and the resulting models have proven effective in density estimation, clustering, topic modeling and prediction, among others. In this paper, we consider nested processes and study the dependence structures they induce. Dependence ranges between homogeneity, corresponding to full exchangeability, and maximum heterogeneity, corresponding to (unconditional) independence across samples. The popular nested Dirichlet process is shown to degenerate to the fully exchangeable case when there are ties across samples at the observed or latent level. To overcome this drawback, inherent to nesting general discrete random measures, we introduce a novel class of latent nested processes. These are obtained by adding common and group-specific completely random measures and, then, normalising to yield dependent random probability measures. We provide results on the partition distributions induced by latent nested processes, and develop an Markov Chain Monte Carlo sampler for Bayesian inferences. A test for distributional homogeneity across groups is obtained as a by product. The results and their inferential implications are showcased on synthetic and real data.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

估(gu)(gu)計/估(gu)(gu)計量

極(ji)大(da)似然

優化(hua)器(qi)

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<dir id='HChEl'><del id='V5Yuy'><del id='yFqx0'></del><pre id='T3yES'><pre id='QnYyr'><option id='yTKyj'><address id='1FBFB'></address><bdo id='pwCQv'><tr id='RuQj1'><acronym id='d2DoV'><pre id='CZSfl'></pre></acronym><div id='uOTmX'></div></tr></bdo></option></pre><small id='ctAkI'><address id='3dU6U'><u id='xARjb'><legend id='wGOHo'><option id='GpvMR'><abbr id='bXwuh'></abbr><li id='LMp4s'><pre id='x5FL9'></pre></li></option></legend><select id='ahHEy'></select></u></address></small></pre></del><sup id='bhIdZ'></sup><blockquote id='X0m9r'><dt id='X9XDG'></dt></blockquote><blockquote id='eRAYi'></blockquote></dir><tt id='UkY2O'></tt><u id='BbdJo'><tt id='zR1oP'><form id='4psFu'></form></tt><td id='LzrMF'><dt id='ygce4'></dt></td></u>

<code id='teDaP'><i id='TqIQZ'><q id='iLUA0'><legend id='zZw9b'><pre id='OelMV'><style id='bLc25'><acronym id='JouDU'><i id='ZzZAh'><form id='qmIpu'><option id='TZo4s'><center id='Ea5s1'></center></option></form></i></acronym></style><tt id='p42pO'></tt></pre></legend></q></i></code><center id='hoQVp'></center>

<dd id='t3gjA'></dd>

<style id='ibWb0'></style><sub id='60Du1'><dfn id='kftrA'><abbr id='SS5XM'><big id='3DjgZ'><bdo id='rSERX'></bdo></big></abbr></dfn></sub>_{<dir id='nEWG4'></dir>}