一级欧美一级日韩大片,人妻丰满AV中文久久不卡,亚洲一区二区成人,国产精品视频一区

We study the effect of approximation errors in assessing the extreme behaviour of univariate functionals of random objects. We build our framework into a general setting where estimation of the extreme value index and extreme quantiles of the functional is based on some approximated value instead of the true one. As an example, we consider the effect of discretisation errors in computation of the norms of paths of stochastic processes. In particular, we quantify connections between the sample size $n$ (the number of observed paths), the number of the discretisation points $m$, and the modulus of continuity function $\phi$ describing the path continuity of the underlying stochastic process. As an interesting example fitting into our framework, we consider processes of form $Y(t) = \mathcal{R}Z(t)$, where $\mathcal{R}$ is a heavy-tailed random variable and the increments of the process $Z$ have lighter tails compared to $\mathcal{R}$.

相關內容

估計/估計量

關注 3

可交換的 · 相互獨立的 · 統計量 · 樣本 · SimPLe ·

2023 年 8 月 30 日

A simple and flexible test of sample exchangeability with applications to statistical genomics

Alan J. Aw,Jeffrey P. Spence,Yun S. Song

from arxiv, 24 pages. Supplementary Information file (38 pages, contains mathematical proofs) is available at //github.com/songlab-cal/flinty/

In scientific studies involving analyses of multivariate data, basic but important questions often arise for the researcher: Is the sample exchangeable, meaning that the joint distribution of the sample is invariant to the ordering of the units? Are the features independent of one another, or perhaps the features can be grouped so that the groups are mutually independent? In statistical genomics, these considerations are fundamental to downstream tasks such as demographic inference and the construction of polygenic risk scores. We propose a non-parametric approach, which we call the V test, to address these two questions, namely, a test of sample exchangeability given dependency structure of features, and a test of feature independence given sample exchangeability. Our test is conceptually simple, yet fast and flexible. It controls the Type I error across realistic scenarios, and handles data of arbitrary dimensions by leveraging large-sample asymptotics. Through extensive simulations and a comparison against unsupervised tests of stratification based on random matrix theory, we find that our test compares favorably in various scenarios of interest. We apply the test to data from the 1000 Genomes Project, demonstrating how it can be employed to assess exchangeability of the genetic sample, or find optimal linkage disequilibrium (LD) splits for downstream analysis. For exchangeability assessment, we find that removing rare variants can substantially increase the p-value of the test statistic. For optimal LD splitting, the V test reports different optimal splits than previous approaches not relying on hypothesis testing. Software for our methods is available in R (CRAN: flintyR) and Python (PyPI: flintyPy).

估計/估計量 · 得分 · Analysis · Weight · 數據集 ·

2023 年 8 月 28 日

Multiple imputation for propensity score analysis with covariates missing at random: some clarity on within and across methods

Trang Quynh Nguyen,Elizabeth A. Stuart

In epidemiology and social sciences, propensity score methods are popular for estimating treatment effects using observational data, and multiple imputation is popular for handling covariate missingness. However, how to appropriately use multiple imputation for propensity score analysis is not completely clear. This paper aims to bring clarity on the consistency (or lack thereof) of methods that have been proposed, focusing on the within approach (where the effect is estimated separately in each imputed dataset and then the multiple estimates are combined) and the across approach (where typically propensity scores are averaged across imputed datasets before being used for effect estimation). We show that the within method is valid and can be used with any causal effect estimator that is consistent in the full-data setting. Existing across methods are inconsistent, but a different across method that averages the inverse probability weights across imputed datasets is consistent for propensity score weighting. We also comment on methods that rely on imputing a function of the missing covariate rather than the covariate itself, including imputation of the propensity score and of the probability weight. Based on consistency results and practical flexibility, we recommend generally using the standard within method. Throughout, we provide intuition to make the results meaningful to the broad audience of applied researchers.

泰勒 · Processing（編程語言） · 核化 · 再生核希爾伯特空間 · 類別 ·

2023 年 8 月 28 日

A probabilistic Taylor expansion with Gaussian processes

Toni Karvonen,Jon Cockayne,Filip Tronarp,Simo S?rkk?

from arxiv, To appear in Transactions on Machine Learning Research

We study a class of Gaussian processes for which the posterior mean, for a particular choice of data, replicates a truncated Taylor expansion of any order. The data consist of derivative evaluations at the expansion point and the prior covariance kernel belongs to the class of Taylor kernels, which can be written in a certain power series form. We discuss and prove some results on maximum likelihood estimation of parameters of Taylor kernels. The proposed framework is a special case of Gaussian process regression based on data that is orthogonal in the reproducing kernel Hilbert space of the covariance kernel.

估計/估計量 · 縮放 · Minimax · Continuity · 最大似然估計 ·

2023 年 8 月 28 日

Rates of estimation for high-dimensional multi-reference alignment

Zehao Dou,Zhou Fan,Harrison Zhou

from arxiv, 56 pages

We study the continuous multi-reference alignment model of estimating a periodic function on the circle from noisy and circularly-rotated observations. Motivated by analogous high-dimensional problems that arise in cryo-electron microscopy, we establish minimax rates for estimating generic signals that are explicit in the dimension $K$. In a high-noise regime with noise variance $\sigma^2 \gtrsim K$, for signals with Fourier coefficients of roughly uniform magnitude, the rate scales as $\sigma^6$ and has no further dependence on the dimension. This rate is achieved by a bispectrum inversion procedure, and our analyses provide new stability bounds for bispectrum inversion that may be of independent interest. In a low-noise regime where $\sigma^2 \lesssim K/\log K$, the rate scales instead as $K\sigma^2$, and we establish this rate by a sharp analysis of the maximum likelihood estimator that marginalizes over latent rotations. A complementary lower bound that interpolates between these two regimes is obtained using Assouad's hypercube lemma. We extend these analyses also to signals whose Fourier coefficients have a slow power law decay.

Neural Networks · Networking · 情景 · 查全率/召回率 · Analysis ·

2023 年 8 月 27 日

The inverse problem for neural networks

Marcelo Forets,Christian Schilling

We study the problem of computing the preimage of a set under a neural network with piecewise-affine activation functions. We recall an old result that the preimage of a polyhedral set is again a union of polyhedral sets and can be effectively computed. We show several applications of computing the preimage for analysis and interpretability of neural networks.

MoDELS · Analysis · 可約的 · Projection · 講稿 ·

2023 年 8 月 26 日

The complexity paradox: An analysis of modeling education through the lens of complexity science

Daniel Strüber

from arxiv, 5 pages; accepted for publication at the IEEE/ACM International Conference on Modeling Driven Engineering Languages and Systems (MODELS), Educators Symposium

Modeling seeks to tame complexity during software development, by supporting design, analysis, and stakeholder communication. Paradoxically, experiences made by educators indicate that students often perceive modeling as adding complexity, instead of reducing it. In this position paper, I analyse modeling education from the lens of complexity science, a theoretical framework for the study of complex systems. I revisit pedagogical literature where complexity science has been used as a framework for general education and subject-specific education in disciplines such as medicine, project management, and sustainability. I revisit complexity-related challenges from modeling education literature, discuss them in the light of complexity and present recommendations for taming complexity when teaching modeling.

向量化 · 評論員 · 模型評估 · 路徑 · 相互獨立的 ·

2023 年 8 月 26 日

A new displacement-controlled arc-length method for damage mechanics problems

Roshan Philip Saji,Panos Pantidis,Mostafa E. Mobasher

The numerical solution of continuum damage mechanics (CDM) problems suffers from critical points during the material softening stage, and consequently existing iterative solvers are subject to a trade-off between computational expense and solution accuracy. Displacement-controlled arc-length methods were developed to address these challenges, but are currently applicable only to geometrically non-linear problems. In this work, we present a novel displacement-controlled arc-length (DAL) method for CDM problems in both local damage and non-local gradient damage versions. The analytical tangent matrix is derived for the DAL solver in both of the local and the non-local models. In addition, several consistent and non-consistent implementation algorithms are proposed, implemented, and evaluated. Unlike existing force-controlled arc-length solvers that monolithically scale the external force vector, the proposed method treats the external force vector as an independent variable and determines the position of the system on the equilibrium path based on all the nodal variations of the external force vector. Such a flexible approach renders the proposed solver to be substantially more efficient and versatile than existing solvers used in CDM problems. The considerable advantages of the proposed DAL algorithm are demonstrated against several benchmark 1D problems with sharp snap-backs and 2D examples with various boundary conditions and loading scenarios, where the proposed method drastically outperforms existing conventional approaches in terms of accuracy, computational efficiency, and the ability to predict the complete equilibrium path including all critical points.

線性的 · 吸引點 · 操作 · Subspace · 吸引域 ·

2023 年 8 月 25 日

On the lifting and reconstruction of nonlinear systems with multiple attractors

Shaowu Pan,Karthik Duraisamy

from arxiv, 8 pages

The Koopman operator provides a linear perspective on non-linear dynamics by focusing on the evolution of observables in an invariant subspace. Observables of interest are typically linearly reconstructed from the Koopman eigenfunctions. Despite the broad use of Koopman operators over the past few years, there exist some misconceptions about the applicability of Koopman operators to dynamical systems with more than one fixed point. In this work, an explanation is provided for the mechanism of lifting for the Koopman operator of nonlinear systems with multiple attractors. Considering the example of the Duffing oscillator, we show that by exploiting the inherent symmetry between the basins of attraction, a linear reconstruction with three degrees of freedom in the Koopman observable space is sufficient to globally linearize the system.

INFORMS · 估計/估計量 · 對數幾率回歸 · MoDELS · 轉移概率 ·

2023 年 8 月 25 日

Calibration plots for multistate risk predictions models: an overview and simulation comparing novel approaches

Alexander Pate,Matthew Sperrin,Richard D. Riley,Niels Peek,Tjeerd Van Staa,Jamie C. Sergeant,Mamas A. Mamas,Gregory Y. H. Lip,Martin O Flaherty,Michael Barrowman,Iain Buchan,Glen P. Martin

from arxiv, Pre-print for article currently under review

Introduction. There is currently no guidance on how to assess the calibration of multistate models used for risk prediction. We introduce several techniques that can be used to produce calibration plots for the transition probabilities of a multistate model, before assessing their performance in the presence of non-informative and informative censoring through a simulation. Methods. We studied pseudo-values based on the Aalen-Johansen estimator, binary logistic regression with inverse probability of censoring weights (BLR-IPCW), and multinomial logistic regression with inverse probability of censoring weights (MLR-IPCW). The MLR-IPCW approach results in a calibration scatter plot, providing extra insight about the calibration. We simulated data with varying levels of censoring and evaluated the ability of each method to estimate the calibration curve for a set of predicted transition probabilities. We also developed evaluated the calibration of a model predicting the incidence of cardiovascular disease, type 2 diabetes and chronic kidney disease among a cohort of patients derived from linked primary and secondary healthcare records. Results. The pseudo-value, BLR-IPCW and MLR-IPCW approaches give unbiased estimates of the calibration curves under non-informative censoring. These methods remained unbiased in the presence of informative censoring, unless the mechanism was strongly informative, with bias concentrated in the areas of predicted transition probabilities of low density. Conclusions. We recommend implementing either the pseudo-value or BLR-IPCW approaches to produce a calibration curve, combined with the MLR-IPCW approach to produce a calibration scatter plot, which provides additional information over either of the other methods.

衰減 · 降維 · 知識 (knowledge) · Principle · 查準率/準確率 ·

2023 年 8 月 25 日

The time dimensional reduction method to determine the initial conditions without the knowledge of damping coefficients

Thuy T. Le,Linh V. Nguyen,Loc H. Nguyen,Hyunha Park

This paper aims to reconstruct the initial condition of a hyperbolic equation with an unknown damping coefficient. Our approach involves approximating the hyperbolic equation's solution by its truncated Fourier expansion in the time domain and using a polynomial-exponential basis. This truncation process facilitates the elimination of the time variable, consequently, yielding a system of quasi-linear elliptic equations. To globally solve the system without needing an accurate initial guess, we employ the Carleman contraction principle. We provide several numerical examples to illustrate the efficacy of our method. The method not only delivers precise solutions but also showcases remarkable computational efficiency.