亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<dir id='el94j'><del id='el94j'><del id='el94j'></del><pre id='el94j'><pre id='el94j'><option id='el94j'><address id='el94j'></address><bdo id='el94j'><tr id='el94j'><acronym id='el94j'><pre id='el94j'></pre></acronym><div id='el94j'></div></tr></bdo></option></pre><small id='el94j'><address id='el94j'><u id='el94j'><legend id='el94j'><option id='el94j'><abbr id='el94j'></abbr><li id='el94j'><pre id='el94j'></pre></li></option></legend><select id='el94j'></select></u></address></small></pre></del><sup id='el94j'></sup><blockquote id='el94j'><dt id='el94j'></dt></blockquote><blockquote id='el94j'></blockquote></dir><tt id='el94j'></tt><u id='el94j'><tt id='el94j'><form id='el94j'></form></tt><td id='el94j'><dt id='el94j'></dt></td></u>

<code id='el94j'><i id='el94j'><q id='el94j'><legend id='el94j'><pre id='el94j'><style id='el94j'><acronym id='el94j'><i id='el94j'><form id='el94j'><option id='el94j'><center id='el94j'></center></option></form></i></acronym></style><tt id='el94j'></tt></pre></legend></q></i></code><center id='el94j'></center>

<dd id='el94j'></dd>

<style id='el94j'></style><sub id='el94j'><dfn id='el94j'><abbr id='el94j'><big id='el94j'><bdo id='el94j'></bdo></big></abbr></dfn></sub>_{<dir id='el94j'></dir>}

·

矩陣論 · 情景 · MoDELS · Integration · 經驗模型 ·

2022 年 7 月 24 日

Test Set Sizing Via Random Matrix Theory

Alexander Dubbs

This paper uses techniques from Random Matrix Theory to find the ideal training-testing data split for a simple linear regression with m data points, each an independent n-dimensional multivariate Gaussian. It defines "ideal" as satisfying the integrity metric, i.e. the empirical model error is the actual measurement noise, and thus fairly reflects the value or lack of same of the model. This paper is the first to solve for the training and test size for any model in a way that is truly optimal. The number of data points in the training set is the root of a quartic polynomial Theorem 1 derives which depends only on m and n; the covariance matrix of the multivariate Gaussian, the true model parameters, and the true measurement noise drop out of the calculations. The critical mathematical difficulties were realizing that the problems herein were discussed in the context of the Jacobi Ensemble, a probability distribution describing the eigenvalues of a known random matrix model, and evaluating a new integral in the style of Selberg and Aomoto. Mathematical results are supported with thorough computational evidence. This paper is a step towards automatic choices of training/test set sizes in machine learning.

相關內容

控制器 · 知識 (knowledge) · 流形 · 泛函 · 機器人 ·

2022 年 9 月 19 日

Safety Index Synthesis via Sum-of-Squares Programming

Weiye Zhao,Tairan He,Tianhao Wei,Simin Liu,Changliu Liu

Control systems often need to satisfy strict safety requirements. Safety index provides a handy way to evaluate the safety level of the system and derive the resulting safe control policies. However, designing safety index functions under control limits is difficult and requires a great amount of expert knowledge. This paper proposes a framework for synthesizing the safety index for general control systems using sum-of-squares programming. Our approach is to show that ensuring the non-emptiness of safe control on the safe set boundary is equivalent to a local manifold positiveness problem. We then prove that this problem is equivalent to sum-of-squares programming via the Positivstellensatz of algebraic geometry. We validate the proposed method on robot arms with different degrees of freedom and ground vehicles. The results show that the synthesized safety index guarantees safety and our method is effective even in high-dimensional robot systems.

自助法/自舉法 · 自助采樣法 · 分解的 · 協方差矩陣 · 樣本 ·

2022 年 9 月 19 日

Testing the number of common factors by bootstrapped sample covariance matrix in high-dimensional factor models

Long Yu,Peng Zhao,Wang Zhou

from arxiv, 95 pages, 9 figures, 4 tables

This paper studies the impact of bootstrap procedure on the eigenvalue distributions of the sample covariance matrix under the high-dimensional factor structure. We provide asymptotic distributions for the top eigenvalues of bootstrapped sample covariance matrix under mild conditions. After bootstrap, the spiked eigenvalues which are driven by common factors will converge weakly to Gaussian limits via proper scaling and centralization. However, the largest non-spiked eigenvalue is mainly determined by order statistics of bootstrap resampling weights, and follows extreme value distribution. Based on the disparate behavior of the spiked and non-spiked eigenvalues, we propose innovative methods to test the number of common factors. According to the simulations and a real data example, the proposed methods are the only ones performing reliably and convincingly under the existence of both weak factors and cross-sectionally correlated errors. Our technical details contribute to random matrix theory on spiked covariance model with convexly decaying density and unbounded support, or with general elliptical distributions.

估計/估計量 · 混合專家模型 · MoDELS · 統計量 · 特征選擇 ·

2022 年 9 月 19 日

An $l_1$-oracle inequality for the Lasso in high-dimensional mixtures of experts models

TrungTin Nguyen,Hien D Nguyen,Faicel Chamroukhi,Geoffrey J McLachlan

from arxiv, Added more explanations. Amended title

Mixtures of experts (MoE) models are a popular framework for modeling heterogeneity in data, for both regression and classification problems in statistics and machine learning, due to their flexibility and the abundance of available statistical estimation and model choice tools. Such flexibility comes from allowing the mixture weights (or gating functions) in the MoE model to depend on the explanatory variables, along with the experts (or component densities). This permits the modeling of data arising from more complex data generating processes when compared to the classical finite mixtures and finite mixtures of regression models, whose mixing parameters are independent of the covariates. The use of MoE models in a high-dimensional setting, when the number of explanatory variables can be much larger than the sample size, is challenging from a computational point of view, and in particular from a theoretical point of view, where the literature is still lacking results for dealing with the curse of dimensionality, for both the statistical estimation and feature selection problems. We consider the finite MoE model with soft-max gating functions and Gaussian experts for high-dimensional regression on heterogeneous data, and its $l_1$-regularized estimation via the Lasso. We focus on the Lasso estimation properties rather than its feature selection properties. We provide a lower bound on the regularization parameter of the Lasso function that ensures an $l_1$-oracle inequality satisfied by the Lasso estimator according to the Kullback--Leibler loss.

優化器 · Pair · 講稿 · 線性的 · 論文 ·

2022 年 9 月 16 日

2D Eigenvalue Problem I: Existence and Number of Solutions

Yangfeng Su,Tianyi Lu,Zhaojun Bai

from arxiv, 22 pages, 9 figures

A two dimensional eigenvalue problem (2DEVP) of a Hermitian matrix pair $(A, C)$ is introduced in this paper. The 2DEVP can be viewed as a linear algebraic formulation of the well-known eigenvalue optimization problem of the parameter matrix $H(\mu) = A - \mu C$. We present fundamental properties of the 2DEVP such as the existence, the necessary and sufficient condition for the finite number of 2D-eigenvalues and variational characterizations. We use eigenvalue optimization problems from the minmax of two Rayleigh quotients and the computation of distance to instability to show their connections with the 2DEVP and new insights of these problems derived from the properties of the 2DEVP.

估計/估計量 · 最大似然估計 · 極大似然 · 似然 · Extensibility ·

2022 年 9 月 16 日

Maximum Likelihood Estimation for Semiparametric Regression Models with Interval-Censored Multi-State Data

Yu Gu,Donglin Zeng,Gerardo Heiss,D. Y. Lin

from arxiv, 49 pages

Interval-censored multi-state data arise in many studies of chronic diseases, where the health status of a subject can be characterized by a finite number of disease states and the transition between any two states is only known to occur over a broad time interval. We formulate the effects of potentially time-dependent covariates on multi-state processes through semiparametric proportional intensity models with random effects. We adopt nonparametric maximum likelihood estimation (NPMLE) under general interval censoring and develop a stable expectation-maximization (EM) algorithm. We show that the resulting parameter estimators are consistent and that the finite-dimensional components are asymptotically normal with a covariance matrix that attains the semiparametric efficiency bound and can be consistently estimated through profile likelihood. In addition, we demonstrate through extensive simulation studies that the proposed numerical and inferential procedures perform well in realistic settings. Finally, we provide an application to a major epidemiologic cohort study.

樣本復雜度 · 樣本 · Extensibility · Markov · 維數災難 ·

2022 年 9 月 15 日

Sample and Computationally Efficient Stochastic Kriging in High Dimensions

Liang Ding,Xiaowei Zhang

from arxiv, main body: 42 pages; supplemental material: 28 pages; 9 figures in total

Stochastic kriging has been widely employed for simulation metamodeling to predict the response surface of complex simulation models. However, its use is limited to cases where the design space is low-dimensional because, in general, the sample complexity (i.e., the number of design points required for stochastic kriging to produce an accurate prediction) grows exponentially in the dimensionality of the design space. The large sample size results in both a prohibitive sample cost for running the simulation model and a severe computational challenge due to the need to invert large covariance matrices. Based on tensor Markov kernels and sparse grid experimental designs, we develop a novel methodology that dramatically alleviates the curse of dimensionality. We show that the sample complexity of the proposed methodology grows only slightly in the dimensionality, even under model misspecification. We also develop fast algorithms that compute stochastic kriging in its exact form without any approximation schemes. We demonstrate via extensive numerical experiments that our methodology can handle problems with a design space of more than 10,000 dimensions, improving both prediction accuracy and computational efficiency by orders of magnitude relative to typical alternative methods in practice.

TOOLS · 統計量 · MoDELS · 情景 · INFORMS ·

2022 年 9 月 15 日

A new set of tools for goodness-of-fit validation

Gilles R. Ducharme,Teresa Ledwina

from arxiv, 35 pages, 10 figures, submitted to Biometrika

We introduce two new tools to assess the validity of statistical distributions. These tools are based on components derived from a new statistical quantity, the $comparison$ $curve$. The first tool is a graphical representation of these components on a $bar$ $plot$ (B plot), which can provide a detailed appraisal of the validity of the statistical model, in particular when supplemented by acceptance regions related to the model. The knowledge gained from this representation can sometimes suggest an existing $goodness$-$of$-$fit$ test to supplement this visual assessment with a control of the type I error. Otherwise, an adaptive test may be preferable and the second tool is the combination of these components to produce a powerful $\chi^2$-type goodness-of-fit test. Because the number of these components can be large, we introduce a new selection rule to decide, in a data driven fashion, on their proper number to take into consideration. In a simulation, our goodness-of-fit tests are seen to be powerwise competitive with the best solutions that have been recommended in the context of a fully specified model as well as when some parameters must be estimated. Practical examples show how to use these tools to derive principled information about where the model departs from the data.

向量化 · 估計/估計量 · 邊緣化 · 優化器 · INFORMS ·

2022 年 9 月 15 日

Structure preservation via the Wasserstein distance

Daniel Bartl,Shahar Mendelson

We show that under minimal assumptions on a random vector $X\in\mathbb{R}^d$, and with high probability, given $m$ independent copies of $X$, the coordinate distribution of each vector $(\langle X_i,\theta \rangle)_{i=1}^m$ is dictated by the distribution of the true marginal $\langle X,\theta \rangle$. Formally, we show that with high probability, \[\sup_{\theta \in S^{d-1}} \left( \frac{1}{m}\sum_{i=1}^m \left|\langle X_i,\theta \rangle^\sharp - \lambda^\theta_i \right|^2 \right)^{1/2} \leq c \left( \frac{d}{m} \right)^{1/4},\] where $\lambda^{\theta}_i = m\int_{(\frac{i-1}{m}, \frac{i}{m}]} F_{ \langle X,\theta \rangle }^{-1}(u)^2 \,du$ and $a^\sharp$ denotes the monotone non-decreasing rearrangement of $a$. The proof follows from the optimal estimate on the worst Wasserstein distance between a marginal of $X$ and its empirical counterpart, $\frac{1}{m} \sum_{i=1}^m \delta_{\langle X_i, \theta \rangle}$. We then use the accurate information on the structures of the vectors $(\langle X_i,\theta \rangle)_{i=1}^m$ to construct the first non-gaussian ensemble that yields the optimal estimate in the Dvoretzky-Milman Theorem: the ensemble exhibits almost Euclidean sections in arbitrary normed spaces of the same dimension as the gaussian embedding -- despite being very far from gaussian (in fact, it happens to be heavy-tailed).

漢明距離 · 穩健性 · 優化器 · 情景 · 類別 ·

2022 年 9 月 14 日

The Complexity Classes of Hamming Distance Recoverable Robust Problems

Christoph Grüne

In the well-known complexity class NP, many combinatorial problems can be found, whose optimization counterpart are important for many practical settings. Those problems usually consider full knowledge about the input and optimize on this specific input. In a practical setting, however, uncertainty in the input data is a usual phenomenon, whereby this is normally not covered in optimization versions of NP problems. One concept to model the uncertainty in the input data, is \textit{recoverable robustness}. In this setting, a solution on the input is calculated, whereby a possible recovery to a good solution should be guaranteed, whenever uncertainty manifests itself. That is, a solution $\texttt{s}_0$ for the base scenario $\textsf{S}_0$ as well as a solution \texttt{s} for every possible scenario of scenario set \textsf{S} has to be calculated. In other words, not only solution $\texttt{s}_0$ for instance $\textsf{S}_0$ is calculated but solutions \texttt{s} for all scenarios from \textsf{S} are prepared to correct possible errors through uncertainty. This paper introduces a specific concept of recoverable robust problems: Hamming Distance Recoverable Robust Problems. In this setting, solutions $\texttt{s}_0$ and \texttt{s} have to be calculated, such that $\texttt{s}_0$ and \texttt{s} may only differ in at most $\kappa$ elements. That is, one can recover from a harmful scenario by choosing a different solution, which is not too far away from the first solution. This paper surveys the complexity of Hamming distance recoverable robust version of optimization problems, typically found in NP for different types of scenarios. The complexity is primarily situated in the lower levels of the polynomial hierarchy. The main contribution of the paper is that recoverable robust problems with compression-encoded scenarios and $m \in \mathbb{N}$ recoveries are $\Sigma^P_{2m+1}$-complete.

非凸 · 優化器 · 因子分解 · 統計量 · 分解的 ·

2019 年 9 月 19 日

Nonconvex Optimization Meets Low-Rank Matrix Factorization: An Overview

Yuejie Chi,Yue M. Lu,Yuxin Chen

from arxiv, Invited overview article

Substantial progress has been made recently on developing provably accurate and efficient algorithms for low-rank matrix factorization via nonconvex optimization. While conventional wisdom often takes a dim view of nonconvex optimization algorithms due to their susceptibility to spurious local minima, simple iterative methods such as gradient descent have been remarkably successful in practice. The theoretical footings, however, had been largely lacking until recently. In this tutorial-style overview, we highlight the important role of statistical models in enabling efficient nonconvex optimization with performance guarantees. We review two contrasting approaches: (1) two-stage algorithms, which consist of a tailored initialization step followed by successive refinement; and (2) global landscape analysis and initialization-free algorithms. Several canonical matrix factorization problems are discussed, including but not limited to matrix sensing, phase retrieval, matrix completion, blind deconvolution, robust principal component analysis, phase synchronization, and joint alignment. Special care is taken to illustrate the key technical insights underlying their analyses. This article serves as a testament that the integrated consideration of optimization and statistics leads to fruitful research findings.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<dir id='el94j'><del id='el94j'><del id='el94j'></del><pre id='el94j'><pre id='el94j'><option id='el94j'><address id='el94j'></address><bdo id='el94j'><tr id='el94j'><acronym id='el94j'><pre id='el94j'></pre></acronym><div id='el94j'></div></tr></bdo></option></pre><small id='el94j'><address id='el94j'><u id='el94j'><legend id='el94j'><option id='el94j'><abbr id='el94j'></abbr><li id='el94j'><pre id='el94j'></pre></li></option></legend><select id='el94j'></select></u></address></small></pre></del><sup id='el94j'></sup><blockquote id='el94j'><dt id='el94j'></dt></blockquote><blockquote id='el94j'></blockquote></dir><tt id='el94j'></tt><u id='el94j'><tt id='el94j'><form id='el94j'></form></tt><td id='el94j'><dt id='el94j'></dt></td></u>

<code id='el94j'><i id='el94j'><q id='el94j'><legend id='el94j'><pre id='el94j'><style id='el94j'><acronym id='el94j'><i id='el94j'><form id='el94j'><option id='el94j'><center id='el94j'></center></option></form></i></acronym></style><tt id='el94j'></tt></pre></legend></q></i></code><center id='el94j'></center>

<dd id='el94j'></dd>

<style id='el94j'></style><sub id='el94j'><dfn id='el94j'><abbr id='el94j'><big id='el94j'><bdo id='el94j'></bdo></big></abbr></dfn></sub>_{<dir id='el94j'></dir>}