亚洲色偷偷色噜噜狠狠99网VR,国产亚洲一区二区三区在线

We obtain a canonical representation for block matrices. The representation facilitates simple computation of the determinant, the matrix inverse, and other powers of a block matrix, as well as the matrix logarithm and the matrix exponential. These results are particularly useful for block covariance and block correlation matrices, where evaluation of the Gaussian log-likelihood and estimation are greatly simplified. We illustrate this with an empirical application using a large panel of daily asset returns. Moreover, the representation paves new ways to regularizing large covariance/correlation matrices, test block structures in matrices, and estimate regressions with many variables.

相關內容

塊

關注 1

閾值 · 相關系數 · 泛函 · 均方誤差 · Networking ·

2022 年 1 月 19 日

On Distributed Lossy Coding of Symmetrically Correlated Gaussian Sources

Siyao Zhou,Sadaf Salehkalaibar,Jingjing Qian,Jun Chen,Wuxian Shi,Yiqun Ge,Wen Tong

In this paper, we consider a distributed lossy compression network with $L$ encoders and a decoder. Each encoder observes a source and compresses it, which is sent to the decoder. Moreover, each observed source can be written as the sum of a target signal and a noise which are independently generated from two symmetric multivariate Gaussian distributions. The decoder jointly constructs the target signals given a threshold on the mean squared error distortion. We are interested in the minimum compression rate of this network versus the distortion threshold which is known as the \emph{rate-distortion function}. We derive a lower bound on the rate-distortion function by solving a convex program, explicitly. The proposed lower bound matches the well-known Berger-Tung's upper bound for some values of the distortion threshold. The asymptotic expressions of the upper and lower bounds are derived in the large $L$ limit. Under specific constraints, the bounds match in the asymptotic regime yielding the characterization of the rate-distortion function.

協方差矩陣 · CASES · 相互獨立的 · 樣本 · 塊 ·

2022 年 1 月 19 日

About limiting spectral distributions of block-rescaled empirical covariance matrices

Gilles Mordant

We establish that the limiting spectral distribution of a block-rescaled empirical covariance matrix is an arcsine law when the ratio between the dimension and the underlying sample size converges to 1 and when the samples corresponding to each block are independent. We further propose a conjecture for the cases where the latter ratio converges to a constant in the unit interval.

估計/估計量 · 結構化學習 · 相關系數 · MoDELS · 線性的 ·

2022 年 1 月 19 日

Linear Polytree Structural Equation Models: Structural Learning and Inverse Correlation Estimation

Xingmei Lou,Yu Hu,Xiaodong Li

from arxiv, 35 pages, 4 figures, 3 tables

We are interested in the problem of learning the directed acyclic graph (DAG) when data are generated from a linear structural equation model (SEM) and the causal structure can be characterized by a polytree. Under the Gaussian polytree models, we study sufficient conditions on the sample sizes for the well-known Chow-Liu algorithm to exactly recover both the skeleton and the equivalence class of the polytree, which is uniquely represented by a CPDAG. On the other hand, necessary conditions on the required sample sizes for both skeleton and CPDAG recovery are also derived in terms of information-theoretic lower bounds, which match the respective sufficient conditions and thereby give a sharp characterization of the difficulty of these tasks. We also consider extensions to the sub-Gaussian case, and then study the estimation of the inverse correlation matrix under such models. Our theoretical findings are illustrated by comprehensive numerical simulations, and experiments on benchmark data also demonstrate the robustness of polytree learning when the true graphical structures can only be approximated by polytrees.

簇 · MoDELS · 相關系數 · 序列化 · 相似度 ·

2022 年 1 月 16 日

Autoregressive Mixture Models for Serial Correlation Clustering of Time Series Data

Benny Ren,Ian Barnett

Clustering time series into similar groups can improve models by combining information across like time series. While there is a well developed body of literature for clustering of time series, these approaches tend to generate clusters independently of model training which can lead to poor model fit. We propose a novel distributed approach that simultaneously clusters and fits autoregression models for groups of similar individuals. We apply a Wishart mixture model so as to cluster individuals while modeling the corresponding autocovariance matrices at the same time. The fitted Wishart scale matrices map to cluster-level autoregressive coefficients through the Yule-Walker equations, fitting robust parsimonious autoregressive mixture models. This approach is able to discern differences in underlying autocorrelation variation of time series in settings with large heterogeneous datasets. We prove consistency of our cluster membership estimator, asymptotic distributions of coefficients and compare our approach against competing methods through simulation as well as by fitting a COVID-19 forecast model.

確切的 · 貝葉斯網/貝葉斯網絡 · 結構化學習 · 縮放 · 協方差矩陣 ·

2022 年 1 月 14 日

Reliable Causal Discovery with Improved Exact Search and Weaker Assumptions

Ignavier Ng,Yujia Zheng,Jiji Zhang,Kun Zhang

from arxiv, NeurIPS 2021. The code is available at //github.com/ignavierng/local-astar

Many of the causal discovery methods rely on the faithfulness assumption to guarantee asymptotic correctness. However, the assumption can be approximately violated in many ways, leading to sub-optimal solutions. Although there is a line of research in Bayesian network structure learning that focuses on weakening the assumption, such as exact search methods with well-defined score functions, they do not scale well to large graphs. In this work, we introduce several strategies to improve the scalability of exact score-based methods in the linear Gaussian setting. In particular, we develop a super-structure estimation method based on the support of inverse covariance matrix which requires assumptions that are strictly weaker than faithfulness, and apply it to restrict the search space of exact search. We also propose a local search strategy that performs exact search on the local clusters formed by each variable and its neighbors within two hops in the super-structure. Numerical experiments validate the efficacy of the proposed procedure, and demonstrate that it scales up to hundreds of nodes with a high accuracy.

矩陣論 · 線性的 · 歐氏空間 · 反向傳播算法 · AIM ·

2022 年 1 月 1 日

Matrix Decomposition and Applications

Jun Lu

from arxiv, arXiv admin note: substantial text overlap with arXiv:2107.02579

In 1954, Alston S. Householder published Principles of Numerical Analysis, one of the first modern treatments on matrix decomposition that favored a (block) LU decomposition-the factorization of a matrix into the product of lower and upper triangular matrices. And now, matrix decomposition has become a core technology in machine learning, largely due to the development of the back propagation algorithm in fitting a neural network. The sole aim of this survey is to give a self-contained introduction to concepts and mathematical tools in numerical linear algebra and matrix analysis in order to seamlessly introduce matrix decomposition techniques and their applications in subsequent sections. However, we clearly realize our inability to cover all the useful and interesting results concerning matrix decomposition and given the paucity of scope to present this discussion, e.g., the separated analysis of the Euclidean space, Hermitian space, Hilbert space, and things in the complex domain. We refer the reader to literature in the field of linear algebra for a more detailed introduction to the related fields.

Neural Networks · 相似度 · 層 · Networking · Performer ·

2021 年 10 月 27 日

Adrián Csiszárik,Péter K?r?si-Szabó,ákos K. Matszangosz,Gergely Papp,Dániel Varga

from arxiv, To appear in the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021)

We employ a toolset -- dubbed Dr. Frankenstein -- to analyse the similarity of representations in deep neural networks. With this toolset, we aim to match the activations on given layers of two trained neural networks by joining them with a stitching layer. We demonstrate that the inner representations emerging in deep convolutional neural networks with the same architecture but different initializations can be matched with a surprisingly high degree of accuracy even with a single, affine stitching layer. We choose the stitching layer from several possible classes of linear transformations and investigate their performance and properties. The task of matching representations is closely related to notions of similarity. Using this toolset, we also provide a novel viewpoint on the current line of research regarding similarity indices of neural network representations: the perspective of the performance on a task.

相關系數 · 相互獨立的 · 變差因素 · 表示學習 · 學成 ·

2021 年 7 月 16 日

On Disentangled Representations Learned From Correlated Data

Frederik Tr?uble,Elliot Creager,Niki Kilbertus,Francesco Locatello,Andrea Dittadi,Anirudh Goyal,Bernhard Sch?lkopf,Stefan Bauer

from arxiv, Published at the 38th International Conference on Machine Learning (ICML 2021)

The focus of disentanglement approaches has been on identifying independent factors of variation in data. However, the causal variables underlying real-world observations are often not statistically independent. In this work, we bridge the gap to real-world scenarios by analyzing the behavior of the most prominent disentanglement approaches on correlated data in a large-scale empirical study (including 4260 models). We show and quantify that systematically induced correlations in the dataset are being learned and reflected in the latent representations, which has implications for downstream applications of disentanglement such as fairness. We also demonstrate how to resolve these latent correlations, either using weak supervision during training or by post-hoc correcting a pre-trained model with a small number of labels.

contrastive · 變換 · 學成 · 判別器 · Performer ·

2020 年 12 月 9 日

Contrastive Transformation for Self-supervised Correspondence Learning

Ning Wang,Wengang Zhou,Houqiang Li

from arxiv, To appear in AAAI 2021

In this paper, we focus on the self-supervised learning of visual correspondence using unlabeled videos in the wild. Our method simultaneously considers intra- and inter-video representation associations for reliable correspondence estimation. The intra-video learning transforms the image contents across frames within a single video via the frame pair-wise affinity. To obtain the discriminative representation for instance-level separation, we go beyond the intra-video analysis and construct the inter-video affinity to facilitate the contrastive transformation across different videos. By forcing the transformation consistency between intra- and inter-video levels, the fine-grained correspondence associations are well preserved and the instance-level feature discrimination is effectively reinforced. Our simple framework outperforms the recent self-supervised correspondence methods on a range of visual tasks including video object tracking (VOT), video object segmentation (VOS), pose keypoint tracking, etc. It is worth mentioning that our method also surpasses the fully-supervised affinity representation (e.g., ResNet) and performs competitively against the recent fully-supervised algorithms designed for the specific tasks (e.g., VOT and VOS).

優化器 · 方差 · 協方差矩陣 · 分離的 · Continuity ·

2018 年 12 月 18 日

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

Perttu H?m?l?inen,Amin Babadi,Xiaoxiao Ma,Jaakko Lehtinen

Proximal Policy Optimization (PPO) is a highly popular model-free reinforcement learning (RL) approach. However, in continuous state and actions spaces and a Gaussian policy -- common in computer animation and robotics -- PPO is prone to getting stuck in local optima. In this paper, we observe a tendency of PPO to prematurely shrink the exploration variance, which naturally leads to slow progress. Motivated by this, we borrow ideas from CMA-ES, a black-box optimization method designed for intelligent adaptive Gaussian exploration, to derive PPO-CMA, a novel proximal policy optimization approach that can expand the exploration variance on objective function slopes and shrink the variance when close to the optimum. This is implemented by using separate neural networks for policy mean and variance and training the mean and variance in separate passes. Our experiments demonstrate a clear improvement over vanilla PPO in many difficult OpenAI Gym MuJoCo tasks.