亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<dir id='LeEEG'><del id='hwQGl'><del id='jvZeA'></del><pre id='IKJ5v'><pre id='IsUmi'><option id='SOxbw'><address id='wnoh4'></address><bdo id='dZGP8'><tr id='wBJ9o'><acronym id='yg1r7'><pre id='TLEnq'></pre></acronym><div id='turvY'></div></tr></bdo></option></pre><small id='C9OkZ'><address id='6hSlx'><u id='Sd470'><legend id='gSW9T'><option id='LYgfM'><abbr id='VTa1J'></abbr><li id='THk2F'><pre id='xXyjs'></pre></li></option></legend><select id='ryP4n'></select></u></address></small></pre></del><sup id='QRIok'></sup><blockquote id='KItLm'><dt id='O2JIe'></dt></blockquote><blockquote id='mg2LO'></blockquote></dir><tt id='k3cAL'></tt><u id='tuMPC'><tt id='njm26'><form id='dkWQ9'></form></tt><td id='loIP2'><dt id='FnZeg'></dt></td></u>

<code id='rXyYa'><i id='0Jp3X'><q id='rgNdl'><legend id='gJdC2'><pre id='BGhC6'><style id='nMB9Z'><acronym id='V0Dfm'><i id='uxWYK'><form id='e3qjb'><option id='SU6S7'><center id='CxKXl'></center></option></form></i></acronym></style><tt id='VQ64X'></tt></pre></legend></q></i></code><center id='xircb'></center>

<dd id='D3daC'></dd>

<style id='HPfrt'></style><sub id='tjy2r'><dfn id='BCWFe'><abbr id='dsnHs'><big id='9or9T'><bdo id='uqAId'></bdo></big></abbr></dfn></sub>_{<dir id='cT1Yu'></dir>}

·

高斯過程回歸 · 核化 · Processing（編程語言） · 塊 · Extensibility ·

2021 年 12 月 17 日

GP-HMAT: Scalable, ${O}(n\log(n))$ Gaussian Process Regression with Hierarchical Low-Rank Matrices

Vahid Keshavarzzadeh,Shandian Zhe,Robert M. Kirby,Akil Narayan

A Gaussian process (GP) is a powerful and widely used regression technique. The main building block of a GP regression is the covariance kernel, which characterizes the relationship between pairs in the random field. The optimization to find the optimal kernel, however, requires several large-scale and often unstructured matrix inversions. We tackle this challenge by introducing a hierarchical matrix approach, named HMAT, which effectively decomposes the matrix structure, in a recursive manner, into significantly smaller matrices where a direct approach could be used for inversion. Our matrix partitioning uses a particular aggregation strategy for data points, which promotes the low-rank structure of off-diagonal blocks in the hierarchical kernel matrix. We employ a randomized linear algebra method for matrix reduction on the low-rank off-diagonal blocks without factorizing a large matrix. We provide analytical error and cost estimates for the inversion of the matrix, investigate them empirically with numerical computations, and demonstrate the application of our approach on three numerical examples involving GP regression for engineering problems and a large-scale real dataset. We provide the computer implementation of GP-HMAT, HMAT adapted for GP likelihood and derivative computations, and the implementation of the last numerical example on a real dataset. We demonstrate superior scalability of the HMAT approach compared to built-in $\backslash$ operator in MATLAB for large-scale linear solves $\bf{A}\bf{x} = \bf{y}$ via a repeatable and verifiable empirical study. An extension to hierarchical semiseparable (HSS) matrices is discussed as future research.

相關內容

高斯過程回歸

高斯過程回歸

核化 · Processing（編程語言） · 樣本 · 拒絕采樣 · 核矩陣 ·

2022 年 4 月 19 日

Scalable Sampling for Nonsymmetric Determinantal Point Processes

Insu Han,Mike Gartrell,Jennifer Gillenwater,Elvis Dohmatob,Amin Karbasi

from arxiv, ICLR 2022

A determinantal point process (DPP) on a collection of $M$ items is a model, parameterized by a symmetric kernel matrix, that assigns a probability to every subset of those items. Recent work shows that removing the kernel symmetry constraint, yielding nonsymmetric DPPs (NDPPs), can lead to significant predictive performance gains for machine learning applications. However, existing work leaves open the question of scalable NDPP sampling. There is only one known DPP sampling algorithm, based on Cholesky decomposition, that can directly apply to NDPPs as well. Unfortunately, its runtime is cubic in $M$, and thus does not scale to large item collections. In this work, we first note that this algorithm can be transformed into a linear-time one for kernels with low-rank structure. Furthermore, we develop a scalable sublinear-time rejection sampling algorithm by constructing a novel proposal distribution. Additionally, we show that imposing certain structural constraints on the NDPP kernel enables us to bound the rejection rate in a way that depends only on the kernel rank. In our experiments we compare the speed of all of these samplers for a variety of real-world tasks.

因子分析 · INFORMS · 分解的 · 罰項 · 準則 ·

2022 年 4 月 19 日

Choosing the number of factors in factor analysis with incomplete data via a hierarchical Bayesian information criterion

Jianhua Zhao,Changchun Shang,Shulan Li,Ling Xin,Philip L. H. Yu

from arxiv, 16 pages, 4 figures

The Bayesian information criterion (BIC), defined as the observed data log likelihood minus a penalty term based on the sample size $N$, is a popular model selection criterion for factor analysis with complete data. This definition has also been suggested for incomplete data. However, the penalty term based on the `complete' sample size $N$ is the same no matter whether in a complete or incomplete data case. For incomplete data, there are often only $N_i<N$ observations for variable $i$, which means that using the `complete' sample size $N$ implausibly ignores the amounts of missing information inherent in incomplete data. Given this observation, a novel criterion called hierarchical BIC (HBIC) for factor analysis with incomplete data is proposed. The novelty is that it only uses the actual amounts of observed information, namely $N_i$'s, in the penalty term. Theoretically, it is shown that HBIC is a large sample approximation of variational Bayesian (VB) lower bound, and BIC is a further approximation of HBIC, which means that HBIC shares the theoretical consistency of BIC. Experiments on synthetic and real data sets are conducted to access the finite sample performance of HBIC, BIC, and related criteria with various missing rates. The results show that HBIC and BIC perform similarly when the missing rate is small, but HBIC is more accurate when the missing rate is not small.

Continuity · 優化器 · 代價函數 · 控制器 · 泛函 ·

2022 年 4 月 19 日

Energy-Based Continuous Inverse Optimal Control

Yifei Xu,Jianwen Xie,Tianyang Zhao,Chris Baker,Yibiao Zhao,Ying Nian Wu

The problem of continuous inverse optimal control (over finite time horizon) is to learn the unknown cost function over the sequence of continuous control variables from expert demonstrations. In this article, we study this fundamental problem in the framework of energy-based model, where the observed expert trajectories are assumed to be random samples from a probability density function defined as the exponential of the negative cost function up to a normalizing constant. The parameters of the cost function are learned by maximum likelihood via an "analysis by synthesis" scheme, which iterates (1) synthesis step: sample the synthesized trajectories from the current probability density using the Langevin dynamics via back-propagation through time, and (2) analysis step: update the model parameters based on the statistical difference between the synthesized trajectories and the observed trajectories. Given the fact that an efficient optimization algorithm is usually available for an optimal control problem, we also consider a convenient approximation of the above learning method, where we replace the sampling in the synthesis step by optimization. Moreover, to make the sampling or optimization more efficient, we propose to train the energy-based model simultaneously with a top-down trajectory generator via cooperative learning, where the trajectory generator is used to fast initialize the synthesis step of the energy-based model. We demonstrate the proposed methods on autonomous driving tasks, and show that they can learn suitable cost functions for optimal control.

相互獨立的 · 隨機變量 · 均值 · 聯合分布 · 蒙特卡羅 ·

2022 年 4 月 19 日

Asymptotic Independence of the Quadratic form and Maximum of Independent Random Variables with Applications to High-Dimensional Tests

Dachuan Chen,Long Feng

This paper establishes the asymptotic independence between the quadratic form and maximum of a sequence of independent random variables. Based on this theoretical result, we find the asymptotic joint distribution for the quadratic form and maximum, which can be applied into the high-dimensional testing problems. By combining the sum-type test and the max-type test, we propose the Fisher's combination tests for the one-sample mean test and two-sample mean test. Under this novel general framework, several strong assumptions in existing literature have been relaxed. Monte Carlo simulation has been done which shows that our proposed tests are strongly robust to both sparse and dense data.

估計/估計量 · Kronecker積 · 協方差矩陣 · Performer · 正則化項 ·

2022 年 4 月 18 日

Covariance Estimation for Matrix-valued Data

Yichi Zhang,Weining Shen,Dehan Kong

Covariance estimation for matrix-valued data has received an increasing interest in applications. Unlike previous works that rely heavily on matrix normal distribution assumption and the requirement of fixed matrix size, we propose a class of distribution-free regularized covariance estimation methods for high-dimensional matrix data under a separability condition and a bandable covariance structure. Under these conditions, the original covariance matrix is decomposed into a Kronecker product of two bandable small covariance matrices representing the variability over row and column directions. We formulate a unified framework for estimating bandable covariance, and introduce an efficient algorithm based on rank one unconstrained Kronecker product approximation. The convergence rates of the proposed estimators are established, and the derived minimax lower bound shows our proposed estimator is rate-optimal under certain divergence regimes of matrix size. We further introduce a class of robust covariance estimators and provide theoretical guarantees to deal with heavy-tailed data. We demonstrate the superior finite-sample performance of our methods using simulations and real applications from a gridded temperature anomalies dataset and a S&P 500 stock data analysis.

子采樣 · 嶺回歸 · 優化器 · 隨機采樣 · Performer ·

2022 年 4 月 18 日

Optimal Subsampling for High-dimensional Ridge Regression

Hanyu Li,Chengmei Niu

We investigate the feature compression of high-dimensional ridge regression using the optimal subsampling technique. Specifically, based on the basic framework of random sampling algorithm on feature for ridge regression and the A-optimal design criterion, we first obtain a set of optimal subsampling probabilities. Considering that the obtained probabilities are uneconomical, we then propose the nearly optimal ones. With these probabilities, a two step iterative algorithm is established which has lower computational cost and higher accuracy. We provide theoretical analysis and numerical experiments to support the proposed methods. Numerical results demonstrate the decent performance of our methods.

因子分析 · 分解的 · Performer · 學成 · 潛在 ·

2022 年 4 月 16 日

Graph-incorporated Latent Factor Analysis for High-dimensional and Sparse Matrices

Di Wu,Yi He,Xin Luo

A High-dimensional and sparse (HiDS) matrix is frequently encountered in a big data-related application like an e-commerce system or a social network services system. To perform highly accurate representation learning on it is of great significance owing to the great desire of extracting latent knowledge and patterns from it. Latent factor analysis (LFA), which represents an HiDS matrix by learning the low-rank embeddings based on its observed entries only, is one of the most effective and efficient approaches to this issue. However, most existing LFA-based models perform such embeddings on a HiDS matrix directly without exploiting its hidden graph structures, thereby resulting in accuracy loss. To address this issue, this paper proposes a graph-incorporated latent factor analysis (GLFA) model. It adopts two-fold ideas: 1) a graph is constructed for identifying the hidden high-order interaction (HOI) among nodes described by an HiDS matrix, and 2) a recurrent LFA structure is carefully designed with the incorporation of HOI, thereby improving the representa-tion learning ability of a resultant model. Experimental results on three real-world datasets demonstrate that GLFA outperforms six state-of-the-art models in predicting the missing data of an HiDS matrix, which evidently supports its strong representation learning ability to HiDS data.

估計/估計量 · 統計量 · 優化器 · 穩健性 · 噪聲 ·

2022 年 4 月 16 日

Computationally Efficient and Statistically Optimal Robust Low-rank Matrix Estimation

Yinan Shen,Jingyang Li,Jian-Feng Cai,Dong Xia

Low-rank matrix estimation under heavy-tailed noise is challenging, both computationally and statistically. Convex approaches have been proven statistically optimal but suffer from high computational costs, especially since robust loss functions are usually non-smooth. More recently, computationally fast non-convex approaches via sub-gradient descent are proposed, which, unfortunately, fail to deliver a statistically consistent estimator even under sub-Gaussian noise. In this paper, we introduce a novel Riemannian sub-gradient (RsGrad) algorithm which is not only computationally efficient with linear convergence but also is statistically optimal, be the noise Gaussian or heavy-tailed. Convergence theory is established for a general framework and specific applications to absolute loss, Huber loss, and quantile loss are investigated. Compared with existing non-convex methods, ours reveals a surprising phenomenon of dual-phase convergence. In phase one, RsGrad behaves as in a typical non-smooth optimization that requires gradually decaying stepsizes. However, phase one only delivers a statistically sub-optimal estimator which is already observed in the existing literature. Interestingly, during phase two, RsGrad converges linearly as if minimizing a smooth and strongly convex objective function and thus a constant stepsize suffices. Underlying the phase-two convergence is the smoothing effect of random noise to the non-smooth robust losses in an area close but not too close to the truth. Lastly, RsGrad is applicable for low-rank tensor estimation under heavy-tailed noise where a statistically optimal rate is attainable with the same phenomenon of dual-phase convergence, and a novel shrinkage-based second-order moment method is guaranteed to deliver a warm initialization. Numerical simulations confirm our theoretical discovery and showcase the superiority of RsGrad over prior methods.

模型選擇 · MoDELS · 蒙特卡羅 · 似然 · 邊緣似然函數 ·

2022 年 4 月 15 日

Proximal nested sampling for high-dimensional Bayesian model selection

Xiaohao Cai,Jason D. McEwen,Marcelo Pereyra

Bayesian model selection provides a powerful framework for objectively comparing models directly from observed data, without reference to ground truth data. However, Bayesian model selection requires the computation of the marginal likelihood (model evidence), which is computationally challenging, prohibiting its use in many high-dimensional Bayesian inverse problems. With Bayesian imaging applications in mind, in this work we present the proximal nested sampling methodology to objectively compare alternative Bayesian imaging models for applications that use images to inform decisions under uncertainty. The methodology is based on nested sampling, a Monte Carlo approach specialised for model comparison, and exploits proximal Markov chain Monte Carlo techniques to scale efficiently to large problems and to tackle models that are log-concave and not necessarily smooth (e.g., involving l_1 or total-variation priors). The proposed approach can be applied computationally to problems of dimension O(10^6) and beyond, making it suitable for high-dimensional inverse imaging problems. It is validated on large Gaussian models, for which the likelihood is available analytically, and subsequently illustrated on a range of imaging problems where it is used to analyse different choices of dictionary and measurement model.

穩健性 · FAST · 異常點 · 線性的 · 原點 ·

2022 年 4 月 15 日

Structured Gradient Descent for Fast Robust Low-Rank Hankel Matrix Completion

HanQin Cai,Jian-Feng Cai,Juntao You

We study the robust matrix completion problem for the low-rank Hankel matrix, which detects the sparse corruptions caused by extreme outliers while we try to recover the original Hankel matrix from the partial observation. In this paper, we explore the convenient Hankel structure and propose a novel non-convex algorithm, coined Hankel Structured Gradient Descent (HSGD), for large-scale robust Hankel matrix completion problems. HSGD is highly computing- and sample-efficient compared to the state-of-the-arts. The recovery guarantee with a linear convergence rate has been established for HSGD under some mild assumptions. The empirical advantages of HSGD are verified on both synthetic datasets and real-world nuclear magnetic resonance signals.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

高斯過程回歸

Processing（編程語言）

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<dir id='sfY4l'><del id='VgYtt'><del id='KRTIJ'></del><pre id='MuWaC'><pre id='8alhC'><option id='xuARc'><address id='Ni48z'></address><bdo id='lcyup'><tr id='V7oNc'><acronym id='ELwlu'><pre id='j2oG1'></pre></acronym><div id='s4RAX'></div></tr></bdo></option></pre><small id='BPhrP'><address id='BXWjy'><u id='J6CVR'><legend id='aVIf8'><option id='W8pw3'><abbr id='W5whg'></abbr><li id='rv8GY'><pre id='fZ88K'></pre></li></option></legend><select id='RYnXU'></select></u></address></small></pre></del><sup id='nyCZZ'></sup><blockquote id='HvLCb'><dt id='DAUy4'></dt></blockquote><blockquote id='GAPcg'></blockquote></dir><tt id='o868L'></tt><u id='lYKEq'><tt id='8UCLO'><form id='xCHrJ'></form></tt><td id='7TEFu'><dt id='PeIde'></dt></td></u>

<code id='5pk9Q'><i id='Cdtpt'><q id='TtfSQ'><legend id='GTiWO'><pre id='ViODt'><style id='0k49E'><acronym id='onbrt'><i id='vP1cm'><form id='vAX7y'><option id='fEhKq'><center id='ooaVS'></center></option></form></i></acronym></style><tt id='WKRPQ'></tt></pre></legend></q></i></code><center id='rjvUf'></center>

<dd id='cad35'></dd>

<style id='OehA4'></style><sub id='ieBL7'><dfn id='fc1D5'><abbr id='2GqFm'><big id='ieMeB'><bdo id='99uj0'></bdo></big></abbr></dfn></sub>_{<dir id='kUsMz'></dir>}