国产综合欧美日韩激情在线,69WW无码免费视频播放

We propose a data-driven way to reduce the noise of covariance matrices of nonstationary systems. In the case of stationary systems, asymptotic approaches were proved to converge to the optimal solutions. Such methods produce eigenvalues that are highly dependent on the inputs, as common sense would suggest. Our approach proposes instead to use a set of eigenvalues totally independent from the inputs and that encode the long-term averaging of the influence of the future on present eigenvalues. Such an influence can be the predominant factor in nonstationary systems. Using real and synthetic data, we show that our data-driven method outperforms optimal methods designed for stationary systems for the filtering of both covariance matrix and its inverse, as illustrated by financial portfolio variance minimization, which makes out method generically relevant to many problems of multivariate inference.

相關內容

協方差矩陣

關注 0

在概率論和統計學中，協方差矩陣（也稱為自協方差矩陣，色散矩陣，方差矩陣或方差-協方差矩陣）是平方矩陣，給出了給定隨機向量的每對元素之間的協方差。在矩陣對角線中存在方差，即每個元素與其自身的協方差。

自助法/自舉法 · 自助采樣法 · 分解的 · 協方差矩陣 · 樣本 ·

2022 年 9 月 19 日

Testing the number of common factors by bootstrapped sample covariance matrix in high-dimensional factor models

Long Yu,Peng Zhao,Wang Zhou

from arxiv, 95 pages, 9 figures, 4 tables

This paper studies the impact of bootstrap procedure on the eigenvalue distributions of the sample covariance matrix under the high-dimensional factor structure. We provide asymptotic distributions for the top eigenvalues of bootstrapped sample covariance matrix under mild conditions. After bootstrap, the spiked eigenvalues which are driven by common factors will converge weakly to Gaussian limits via proper scaling and centralization. However, the largest non-spiked eigenvalue is mainly determined by order statistics of bootstrap resampling weights, and follows extreme value distribution. Based on the disparate behavior of the spiked and non-spiked eigenvalues, we propose innovative methods to test the number of common factors. According to the simulations and a real data example, the proposed methods are the only ones performing reliably and convincingly under the existence of both weak factors and cross-sectionally correlated errors. Our technical details contribute to random matrix theory on spiked covariance model with convexly decaying density and unbounded support, or with general elliptical distributions.

估計/估計量 · 混合專家模型 · MoDELS · 統計量 · 特征選擇 ·

2022 年 9 月 19 日

An $l_1$-oracle inequality for the Lasso in high-dimensional mixtures of experts models

TrungTin Nguyen,Hien D Nguyen,Faicel Chamroukhi,Geoffrey J McLachlan

from arxiv, Added more explanations. Amended title

Mixtures of experts (MoE) models are a popular framework for modeling heterogeneity in data, for both regression and classification problems in statistics and machine learning, due to their flexibility and the abundance of available statistical estimation and model choice tools. Such flexibility comes from allowing the mixture weights (or gating functions) in the MoE model to depend on the explanatory variables, along with the experts (or component densities). This permits the modeling of data arising from more complex data generating processes when compared to the classical finite mixtures and finite mixtures of regression models, whose mixing parameters are independent of the covariates. The use of MoE models in a high-dimensional setting, when the number of explanatory variables can be much larger than the sample size, is challenging from a computational point of view, and in particular from a theoretical point of view, where the literature is still lacking results for dealing with the curse of dimensionality, for both the statistical estimation and feature selection problems. We consider the finite MoE model with soft-max gating functions and Gaussian experts for high-dimensional regression on heterogeneous data, and its $l_1$-regularized estimation via the Lasso. We focus on the Lasso estimation properties rather than its feature selection properties. We provide a lower bound on the regularization parameter of the Lasso function that ensures an $l_1$-oracle inequality satisfied by the Lasso estimator according to the Kullback--Leibler loss.

規范化的 · 近似 · 共軛先驗 · 極大似然 · Performer ·

2022 年 9 月 19 日

Normal approximation for the posterior in exponential families

Adrian Fischer,Robert E. Gaunt,Gesine Reinert,Yvik Swan

In this paper we obtain quantitative Bernstein-von Mises type bounds on the normal approximation of the posterior distribution in exponential family models when centering either around the posterior mode or around the maximum likelihood estimator. Our bounds, obtained through a version of Stein's method, are non-asymptotic, and data dependent; they are of the correct order both in the total variation and Wasserstein distances, as well as for approximations for expectations of smooth functions of the posterior. All our results are valid for univariate and multivariate posteriors alike, and do not require a conjugate prior setting. We illustrate our findings on a variety of exponential family distributions, including Poisson, multinomial and normal distribution with unknown mean and variance. The resulting bounds have an explicit dependence on the prior distribution and on sufficient statistics of the data from the sample, and thus provide insight into how these factors may affect the quality of the normal approximation. The performance of the bounds is also assessed with simulations.

離散化 · Analysis · 泛函 · 估計/估計量 · 散度 ·

2022 年 9 月 19 日

Theory of functional principal components analysis for discretely observed data

Hang Zhou,Dongyi Wei,Fang Yao

For discretely observed functional data, estimating eigenfunctions with diverging index is essential in nearly all methods based on functional principal components analysis. In this paper, we propose a new approach to handle each term appeared in the perturbation series and overcome the summability issue caused by the estimation bias. We obtain the moment bounds for eigenfunctions and eigenvalues for a wide range of the sampling rate. We show that under some mild assumptions, the moment bound for the eigenfunctions with diverging indices is optimal in the minimax sense. This is the first attempt at obtaining an optimal rate for eigenfunctions with diverging index for discretely observed functional data. Our results fill the gap in theory between the ideal estimation from fully observed functional data and the reality that observations are taken at discrete time points with noise, which has its own merits in models involving inverse problem and deserves further investigation.

估計/估計量 · 可辨認的 · 試驗 · 分離的 · 有偏 ·

2022 年 9 月 19 日

Selection on treatment in the target population of generalizabillity and transportability analyses

Yu-Han Chiu,Issa J. Dahabreh

Investigators are increasingly using novel methods for extending (generalizing or transporting) causal inferences from a trial to a target population. In many generalizability and transportability analyses, the trial and the observational data from the target population are separately sampled, following a non-nested trial design. In practical implementations of this design, non-randomized individuals from the target population are often identified by conditioning on the use of a particular treatment, while individuals who used other candidate treatments for the same indication or individuals who did not use any treatment are excluded. In this paper, we argue that conditioning on treatment in the target population changes the estimand of generalizability and transportability analyses and potentially introduces serious bias in the estimation of causal estimands in the target population or the subset of the target population using a specific treatment. Furthermore, we argue that the naive application of marginalization-based or weighting-based standardization methods does not produce estimates of any reasonable causal estimand. We use causal graphs and counterfactual arguments to characterize the identification problems induced by conditioning on treatment in the target population and illustrate the problems using simulated data. We conclude by considering the implications of our findings for applied work.

優化器 · Pair · 講稿 · 線性的 · 論文 ·

2022 年 9 月 16 日

2D Eigenvalue Problem I: Existence and Number of Solutions

Yangfeng Su,Tianyi Lu,Zhaojun Bai

from arxiv, 22 pages, 9 figures

A two dimensional eigenvalue problem (2DEVP) of a Hermitian matrix pair $(A, C)$ is introduced in this paper. The 2DEVP can be viewed as a linear algebraic formulation of the well-known eigenvalue optimization problem of the parameter matrix $H(\mu) = A - \mu C$. We present fundamental properties of the 2DEVP such as the existence, the necessary and sufficient condition for the finite number of 2D-eigenvalues and variational characterizations. We use eigenvalue optimization problems from the minmax of two Rayleigh quotients and the computation of distance to instability to show their connections with the 2DEVP and new insights of these problems derived from the properties of the 2DEVP.

正則化項 · 可約的 · 優化器 · 估計/估計量 · 離散化 ·

2022 年 9 月 16 日

Coupling of finite element and boundary element methods with regularization for a nonlinear interface problem with nonmonotone set-valued transmission conditions

J. Gwinner,N. Ovcharova

from arxiv, 27 pages, 1 figure revised version submitted to CAMWA (Computers & Mathematics with Applications)

For the first time, a nonlinear interface problem on an unbounded domain with nonmonotone set-valued transmission conditions is analyzed. The investigated problem involves a nonlinear monotone partial differential equation in the interior domain and the Laplacian in the exterior domain. Such a scalar interface problem models nonmonotone frictional contact of elastic infinite media. The variational formulation of the interface problem leads to a hemivariational inequality, which lives on the unbounded domain, and so cannot be treated numerically in a direct way. By boundary integral methods the problem is transformed and a novel hemivariational inequality (HVI) is obtained that lives on the interior domain and on the coupling boundary, only. Thus for discretization the coupling of finite elements and boundary elements is the method of choice. In addition smoothing techniques of nondifferentiable optimization are adapted and the nonsmooth part in the HVI is regularized. Thus we reduce the original variational problem to a finite dimensional problem that can be solved by standard optimization tools. We establish not only convergence results for the total approximation procedure, but also an asymptotic error estimate for the regularized HVI.

估計/估計量 · INFORMS · MoDELS · 優化器 · Analysis ·

2022 年 9 月 16 日

Nonparametric Estimation via Mixed Gradients

Xiaowu Dai

Traditional nonparametric estimation methods often lead to a slow convergence rate in large dimensions and require unrealistically enormous sizes of datasets for reliable conclusions. We develop an approach based on mixed gradients, either observed or estimated, to effectively estimate the function at near-parametric convergence rates. The novel approach and computational algorithm could lead to methods useful to practitioners in many areas of science and engineering. Our theoretical results reveal a behavior universal to this class of nonparametric estimation problems. We explore a general setting involving tensor product spaces and build upon the smoothing spline analysis of variance (SS-ANOVA) framework. For $d$-dimensional models under full interaction, the optimal rates with gradient information on $p$ covariates are identical to those for the $(d-p)$-interaction models without gradients and, therefore, the models are immune to the "curse of interaction". For additive models, the optimal rates using gradient information are root-$n$, thus achieving the "parametric rate". We demonstrate aspects of the theoretical results through synthetic and real data applications.

估計/估計量 · 流形 · Analysis · 歐氏空間 · 相互獨立的 ·

2022 年 9 月 15 日

Tangent Space and Dimension Estimation with the Wasserstein Distance

Uzu Lim,Harald Oberhauser,Vidit Nanda

from arxiv, Main theorems rewritten. Introduction is written more compactly

Consider a set of points sampled independently near a smooth compact submanifold of Euclidean space. We provide mathematically rigorous bounds on the number of sample points required to estimate both the dimension and the tangent spaces of that manifold with high confidence. The algorithm for this estimation is Local PCA, a local version of principal component analysis. Our results accommodate for noisy non-uniform data distribution with the noise that may vary across the manifold, and allow simultaneous estimation at multiple points. Crucially, all of the constants appearing in our bound are explicitly described. The proof uses a matrix concentration inequality to estimate covariance matrices and a Wasserstein distance bound for quantifying nonlinearity of the underlying manifold and non-uniformity of the probability measure.

Analysis · 不變 · 近似 · 似然 · 向量空間 ·

2022 年 9 月 15 日

On the detrimental effect of invariances in the likelihood for variational inference

Richard Kurle,Ralf Herbrich,Tim Januschowski,Yuyang Wang,Jan Gasthaus

Variational Bayesian posterior inference often requires simplifying approximations such as mean-field parametrisation to ensure tractability. However, prior work has associated the variational mean-field approximation for Bayesian neural networks with underfitting in the case of small datasets or large model sizes. In this work, we show that invariances in the likelihood function of over-parametrised models contribute to this phenomenon because these invariances complicate the structure of the posterior by introducing discrete and/or continuous modes which cannot be well approximated by Gaussian mean-field distributions. In particular, we show that the mean-field approximation has an additional gap in the evidence lower bound compared to a purpose-built posterior that takes into account the known invariances. Importantly, this invariance gap is not constant; it vanishes as the approximation reverts to the prior. We proceed by first considering translation invariances in a linear model with a single data point in detail. We show that, while the true posterior can be constructed from a mean-field parametrisation, this is achieved only if the objective function takes into account the invariance gap. Then, we transfer our analysis of the linear model to neural networks. Our analysis provides a framework for future work to explore solutions to the invariance problem.