斗破苍穹第四季25集免费观看_亚洲欧美中文日韩A_能看视频的黄色网站_美国一级黄片视频在线_性视频一级特黄大播放_亚洲人成网站色在线电影_国产免费又刺激又黄又爽

2023 年 10 月 26 日

Learning with a linear loss function. Excess risk and estimation bounds for ERM, minmax MOM and their regularized versions. Applications to robustness in sparse PCA

Guillaume Lecué,Lucie Neirac

Motivated by several examples, we consider a general framework of learning with linear loss functions. In this context, we provide excess risk and estimation bounds that hold with large probability for four estimators: ERM, minmax MOM and their regularized versions. These general bounds are applied for the problem of robustness in sparse PCA. In particular, we improve the state of the art result for this this problems, obtain results under weak moment assumptions as well as for adversarial contaminated data.

相關內容

PCA

關注 3

在統(tong)計中(zhong)(zhong)，主成(cheng)分(fen)分(fen)析（PCA）是一種通過最(zui)大化每個(ge)維(wei)(wei)(wei)度(du)的(de)方差來將較(jiao)高維(wei)(wei)(wei)度(du)空(kong)(kong)間(jian)(jian)中(zhong)(zhong)的(de)數(shu)據(ju)投影到較(jiao)低維(wei)(wei)(wei)度(du)空(kong)(kong)間(jian)(jian)中(zhong)(zhong)的(de)方法。給定(ding)二維(wei)(wei)(wei)，三維(wei)(wei)(wei)或更高維(wei)(wei)(wei)空(kong)(kong)間(jian)(jian)中(zhong)(zhong)的(de)點(dian)集合，可以將“最(zui)佳擬合”線(xian)定(ding)義為(wei)最(zui)小化從點(dian)到線(xian)的(de)平均平方距離的(de)線(xian)。可以從垂直(zhi)于第一條(tiao)直(zhi)線(xian)的(de)方向(xiang)類似地選(xuan)擇下一條(tiao)最(zui)佳擬合線(xian)。重復此(ci)過程會產生(sheng)一個(ge)正(zheng)交的(de)基礎，其(qi)中(zhong)(zhong)數(shu)據(ju)的(de)不同單(dan)個(ge)維(wei)(wei)(wei)度(du)是不相關的(de)。這(zhe)些基向(xiang)量稱為(wei)主成(cheng)分(fen)。

測試誤差 · 相關系數 · 線性的 · 嶺回歸 · 成比例 ·

2023 年 12 月 14 日

Matrix Dyson equation for correlated linearizations and test error of random features regression

Hugo Latourelle-Vigeant,Elliot Paquette

from arxiv, 58 page, 2 figures

This paper develops some theory of the matrix Dyson equation (MDE) for correlated linearizations and uses it to solve a problem on asymptotic deterministic equivalent for the test error in random features regression. The theory developed for the correlated MDE includes existence-uniqueness, spectral support bounds, and stability properties of the MDE. This theory is new for constructing deterministic equivalents for pseudoresolvents of a class of correlated linear pencils. In the application, this theory is used to give a deterministic equivalent of the test error in random features ridge regression, in a proportional scaling regime, wherein we have conditioned on both training and test datasets.

Networking · 可約的 · 相似度 · 塑造 · Extensibility ·

2023 年 12 月 14 日

PairingNet: A Learning-based Pair-searching and -matching Network for Image Fragments

Rixin Zhou,Ding Xia,Yi Zhang,Honglin Pang,Xi Yang,Chuntao Li

from arxiv, 14 pages, 16 figures, 4 tables

In this paper, we propose a learning-based image fragment pair-searching and -matching approach to solve the challenging restoration problem. Existing works use rule-based methods to match similar contour shapes or textures, which are always difficult to tune hyperparameters for extensive data and computationally time-consuming. Therefore, we propose a neural network that can effectively utilize neighbor textures with contour shape information to fundamentally improve performance. First, we employ a graph-based network to extract the local contour and texture features of fragments. Then, for the pair-searching task, we adopt a linear transformer-based module to integrate these local features and use contrastive loss to encode the global features of each fragment. For the pair-matching task, we design a weighted fusion module to dynamically fuse extracted local contour and texture features, and formulate a similarity matrix for each pair of fragments to calculate the matching score and infer the adjacent segment of contours. To faithfully evaluate our proposed network, we created a new image fragment dataset through an algorithm we designed that tears complete images into irregular fragments. The experimental results show that our proposed network achieves excellent pair-searching accuracy, reduces matching errors, and significantly reduces computational time. Details, sourcecode, and data are available in our supplementary material.

Processing（編程語言） · 自動問答 · 縮放 · 數據集 · SPARQL ·

2023 年 12 月 14 日

MarkQA: A large scale KBQA dataset with numerical reasoning

Xiang Huang,Sitao Cheng,Yuheng Bao,Shanshan Huang,Yuzhong Qu

from arxiv, EMNLP 2023 main conference. Code: //github.com/cdhx/MarkQA Homepage: //ws.nju.edu.cn/MarkQA

While question answering over knowledge bases (KBQA) has shown progress in addressing factoid questions, KBQA with numerical reasoning remains relatively unexplored. In this paper, we focus on the complex numerical reasoning in KBQA and propose a new task, NR-KBQA, which necessitates the ability to perform both multi-hop reasoning and numerical reasoning. We design a logic form in Python format called PyQL to represent the reasoning process of numerical reasoning questions. To facilitate the development of NR-KBQA, we present a large dataset called MarkQA, which is automatically constructed from a small set of seeds. Each question in MarkQA is equipped with its corresponding SPARQL query, alongside the step-by-step reasoning process in the QDMR format and PyQL program. Experimental results of some state-of-the-art QA methods on the MarkQA show that complex numerical reasoning in KBQA faces great challenges.

MoDELS · 閾值 · Performer · 估計/估計量 · 置信度 ·

2023 年 12 月 14 日

The application of accumulation tests in Peaks-Over-Threshold modeling with Norwegian Fire insurance Data

Bowen Liu,Malwane M. A. Ananda

from arxiv, 19 pages, 4 figures, 11 tables,

Modeling excess remains to be an important topic in insurance data modeling. Among the alternatives of modeling excess, the Peaks Over Threshold (POT) framework with Generalized Pareto distribution (GPD) is regarded as an efficient approach due to its flexibility. However, the selection of an appropriate threshold for such framework is a major difficulty. To address such difficulty, we applied several accumulation tests along with Anderson-Darling test to determine an optimal threshold. Based on the selected thresholds, the fitted GPD with the estimated quantiles can be found. We applied the procedure to the well-known Norwegian Fire Insurance data and constructed the confidence intervals for the Value-at-Risks (VaR). The accumulation test approach provides satisfactory performance in modeling the high quantiles of Norwegian Fire Insurance data compared to the previous graphical methods.

估計/估計量 · Markov · MoDELS · 近似 · Learning ·

2023 年 12 月 13 日

Deep learning-based estimation of time-dependent parameters in Markov models with application to nonlinear regression and SDEs

Andrzej Ka?u?a,Pawe? M. Morkisz,Bart?omiej Mulewicz,Pawe? Przyby?owicz,Martyna Wi?cek

We present a novel deep learning method for estimating time-dependent parameters in Markov processes through discrete sampling. Departing from conventional machine learning, our approach reframes parameter approximation as an optimization problem using the maximum likelihood approach. Experimental validation focuses on parameter estimation in multivariate regression and stochastic differential equations (SDEs). Theoretical results show that the real solution is close to SDE with parameters approximated using our neural network-derived under specific conditions. Our work contributes to SDE-based model parameter estimation, offering a versatile tool for diverse fields.

Processing（編程語言） · 配分函數 · MoDELS · 圖 · 泛函 ·

2023 年 12 月 13 日

Using random graphs to sample repulsive Gibbs point processes with arbitrary-range potentials

Tobias Friedrich,Andreas G?bel,Maximilian Katzmann,Martin Krejca,Marcus Pappik

We study computational aspects of repulsive Gibbs point processes, which are probabilistic models of interacting particles in a finite-volume region of space. We introduce an approach for reducing a Gibbs point process to the hard-core model, a well-studied discrete spin system. Given an instance of such a point process, our reduction generates a random graph drawn from a natural geometric model. We show that the partition function of a hard-core model on graphs generated by the geometric model concentrates around the partition function of the Gibbs point process. Our reduction allows us to use a broad range of algorithms developed for the hard-core model to sample from the Gibbs point process and approximate its partition function. This is, to the extend of our knowledge, the first approach that deals with pair potentials of unbounded range. We compare the resulting algorithms with recently established results and study further properties of the random geometric graphs with respect to the hard-core model.

優化器 · 離散化 · 控制器 · 稀疏 · 正則化項 ·

2023 年 12 月 13 日

Fractional, semilinear, and sparse optimal control: a priori error bounds

Francisco Bersetche,Francisco Fuica,Enrique Otarola,Daniel Quero

In this work, we use the integral definition of the fractional Laplace operator and study a sparse optimal control problem involving a fractional, semilinear, and elliptic partial differential equation as state equation; control constraints are also considered. We establish the existence of optimal solutions and first and second order optimality conditions. We also analyze regularity properties for optimal variables. We propose and analyze two finite element strategies of discretization: a fully discrete scheme, where the control variable is discretized with piecewise constant functions, and a semidiscrete scheme, where the control variable is not discretized. For both discretization schemes, we analyze convergence properties and a priori error bounds.

樣本 · 3D · 去噪 · Markov · MoDELS ·

2023 年 12 月 13 日

Denoising diffusion-based synthetic generation of three-dimensional (3D) anisotropic microstructures from two-dimensional (2D) micrographs

Kang-Hyun Lee,Gun Jin Yun

Integrated computational materials engineering (ICME) has significantly enhanced the systemic analysis of the relationship between microstructure and material properties, paving the way for the development of high-performance materials. However, analyzing microstructure-sensitive material behavior remains challenging due to the scarcity of three-dimensional (3D) microstructure datasets. Moreover, this challenge is amplified if the microstructure is anisotropic, as this results in anisotropic material properties as well. In this paper, we present a framework for reconstruction of anisotropic microstructures solely based on two-dimensional (2D) micrographs using conditional diffusion-based generative models (DGMs). The proposed framework involves spatial connection of multiple 2D conditional DGMs, each trained to generate 2D microstructure samples for three different orthogonal planes. The connected multiple reverse diffusion processes then enable effective modeling of a Markov chain for transforming noise into a 3D microstructure sample. Furthermore, a modified harmonized sampling is employed to enhance the sample quality while preserving the spatial connection between the slices of anisotropic microstructure samples in 3D space. To validate the proposed framework, the 2D-to-3D reconstructed anisotropic microstructure samples are evaluated in terms of both the spatial correlation function and the physical material behavior. The results demonstrate that the framework is capable of reproducing not only the statistical distribution of material phases but also the material properties in 3D space. This highlights the potential application of the proposed 2D-to-3D reconstruction framework in establishing microstructure-property linkages, which could aid high-throughput material design for future studies

Ader · 預測器/決策函數 · INTERACT · Performer · 講稿 ·

2023 年 12 月 12 日

Very high order treatment of embedded curved boundaries in compressible flows: ADER discontinuous Galerkin with a space-time Reconstruction for Off-site data

Mirco Ciallella,Stephane Clain,Elena Gaburro,Mario Ricchiuto

In this paper we present a novel approach for the design of high order general boundary conditions when approximating solutions of the Euler equations on domains with curved boundaries, using meshes which may not be boundary conformal. When dealing with curved boundaries and/or unfitted discretizations, the consistency of boundary conditions is a well-known challenge, especially in the context of high order schemes. In order to tackle such consistency problems, the so-called Reconstruction for Off-site Data (ROD) method has been recently introduced in the finite volume framework: it is based on performing a boundary polynomial reconstruction that embeds the considered boundary treatment thanks to the implementation of a constrained minimization problem. This work is devoted to the development of the ROD approach in the context of discontinuous finite elements. We use the genuine space-time nature of the local ADER predictors to reformulate the ROD as a single space-time reconstruction procedure. This allows us to avoid a new reconstruction (linear system inversion) at each sub-time node and retrieve a single space-time polynomial that embeds the considered boundary conditions for the entire space-time element. Several numerical experiments are presented proving the consistency of the new approach for all kinds of boundary conditions. Computations involving the interaction of shocks with embedded curved boundaries are made possible through an a posteriori limiting technique.

Learning · Performer · INFORMS · ENJOY · 估計/估計量 ·

2023 年 12 月 12 日

Nonparametric variable importance for time-to-event outcomes with application to prediction of HIV infection

Charles J. Wolock,Peter B. Gilbert,Noah Simon,Marco Carone

from arxiv, 91 total pages (31 main text, 60 supplementary); 14 total figures (4 main text, 10 supplementary)

In survival analysis, complex machine learning algorithms have been increasingly used for predictive modeling. Given a collection of features available for inclusion in a predictive model, it may be of interest to quantify the relative importance of a subset of features for the prediction task at hand. In particular, in HIV vaccine trials, participant baseline characteristics are used to predict the probability of infection over the intended follow-up period, and investigators may wish to understand how much certain types of predictors, such as behavioral factors, contribute toward overall predictiveness. Time-to-event outcomes such as time to infection are often subject to right censoring, and existing methods for assessing variable importance are typically not intended to be used in this setting. We describe a broad class of algorithm-agnostic variable importance measures for prediction in the context of survival data. We propose a nonparametric efficient estimation procedure that incorporates flexible learning of nuisance parameters, yields asymptotically valid inference, and enjoys double-robustness. We assess the performance of our proposed procedure via numerical simulations and analyze data from the HVTN 702 study to inform enrollment strategies for future HIV vaccine trials.