国产欧美日韩综合在线,成人三级视频在线看网站,欧美一区二区三区四区五区六区七区,欧美巨大精品欧美一区二区

We introduce a novel approach to waveform inversion, based on a data driven reduced order model (ROM) of the wave operator. The presentation is for the acoustic wave equation, but the approach can be extended to elastic or electromagnetic waves. The data are time resolved measurements of the pressure wave at the sensors in an active array, which probe the unknown medium with pulses and measure the generated waves. The ROM depends nonlinearly on the data but it can be constructed from them using numerical linear algebra methods. We show that the ROM can be used for the inverse problem of velocity estimation. While the full-waveform inversion approach of {nonlinear least-squares} data fitting is challenging without low frequency information, due to multiple minima of the objective function, the minimization of the ROM misfit function has a better behavior, even for a poor initial guess. In fact, the ROM misfit function is demonstrably a convex function for low-dimensional parametrizations of the unknown velocity. We give the construction of the ROM, introduce the inversion approach based on the ROM misfit and assess its performance with numerical simulations.

相關內容

可約的

關注 2

Tensor · 閾值 · 代碼 · 優化器 · Analysis ·

2022 年 7 月 21 日

Locally Random P-adic Alloy Codes with Channel Coding Theorems for Distributed Coded Tensors

Pedro Soto,Haibin Guan,Jun Li

from arxiv, 9 pages, preprint

Tensors, i.e., multi-linear functions, are a fundamental building block of machine learning algorithms. In order to train on large data-sets, it is common practice to distribute the computation amongst workers. However, stragglers and other faults can severely impact the performance and overall training time. A novel strategy to mitigate these failures is the use of coded computation. We introduce a new metric for analysis called the typical recovery threshold, which focuses on the most likely event and provide a novel construction of distributed coded tensor operations which are optimal with this measure. We show that our general framework encompasses many other computational schemes and metrics as a special case. In particular, we prove that the recovery threshold and the tensor rank can be recovered as a special case of the typical recovery threshold when the probability of noise, i.e., a fault, is equal to zero, thereby providing a noisy generalization of noiseless computation as a serendipitous result. Far from being a purely theoretical construction, these definitions lead us to practical random code constructions, i.e., locally random p-adic alloy codes, which are optimal with respect to the measures. We analyze experiments conducted on Amazon EC2 and establish that they are faster and more numerically stable than many other benchmark computation schemes in practice, as is predicted by theory.

可約的 · MoDELS · Learning · Machine Learning · 論文 ·

2022 年 7 月 21 日

Hybrid Data-Driven Closure Strategies for Reduced Order Modeling

Anna Ivagnes,Giovanni Stabile,Andrea Mola,Traian Iliescu,Gianluigi Rozza

In this paper, we propose hybrid data-driven ROM closures for fluid flows. These new ROM closures combine two fundamentally different strategies: (i) purely data-driven ROM closures, both for the velocity and the pressure; and (ii) physically based, eddy viscosity data-driven closures, which model the energy transfer in the system. The first strategy consists in the addition of closure/correction terms to the governing equations, which are built from the available data. The second strategy includes turbulence modeling by adding eddy viscosity terms, which are determined by using machine learning techniques. The two strategies are combined for the first time in this paper to investigate a two-dimensional flow past a circular cylinder at Re=50000. Our numerical results show that the hybrid data-driven ROM is more accurate than both the purely data-driven ROM and the eddy viscosity ROM.

成比例 · fMRI · 估計/估計量 · 推斷 · 簇 ·

2022 年 7 月 21 日

Notip: Non-parametric True Discovery Proportion control for brain imaging

Alexandre Blain,Bertrand Thirion,Pierre Neuvial

from arxiv, NeuroImage (2022)

Cluster-level inference procedures are widely used for brain mapping. These methods compare the size of clusters obtained by thresholding brain maps to an upper bound under the global null hypothesis, computed using Random Field Theory or permutations. However, the guarantees obtained by this type of inference - i.e. at least one voxel is truly activated in the cluster - are not informative with regards to the strength of the signal therein. There is thus a need for methods to assess the amount of signal within clusters; yet such methods have to take into account that clusters are defined based on the data, which creates circularity in the inference scheme. This has motivated the use of post hoc estimates that allow statistically valid estimation of the proportion of activated voxels in clusters. In the context of fMRI data, the All-Resolutions Inference framework introduced in [25] provides post hoc estimates of the proportion of activated voxels. However, this method relies on parametric threshold families, which results in conservative inference. In this paper, we leverage randomization methods to adapt to data characteristics and obtain tighter false discovery control. We obtain Notip, for Non-parametric True Discovery Proportion control: a powerful, non-parametric method that yields statistically valid guarantees on the proportion of activated voxels in data-derived clusters. Numerical experiments demonstrate substantial gains in number of detections compared with state-of-the-art methods on 36 fMRI datasets. The conditions under which the proposed method brings benefits are also discussed.

Tensor · 優化器 · Performer · 可約的 · Analysis ·

2022 年 7 月 21 日

Communication Lower Bounds and Optimal Algorithms for Multiple Tensor-Times-Matrix Computation

Hussam Al Daas,Grey Ballard,Laura Grigori,Suraj Kumar,Kathryn Rouse

Multiple Tensor-Times-Matrix (Multi-TTM) is a key computation in algorithms for computing and operating with the Tucker tensor decomposition, which is frequently used in multidimensional data analysis. We establish communication lower bounds that determine how much data movement is required to perform the Multi-TTM computation in parallel. The crux of the proof relies on analytically solving a constrained, nonlinear optimization problem. We also present a parallel algorithm to perform this computation that organizes the processors into a logical grid with twice as many modes as the input tensor. We show that with correct choices of grid dimensions, the communication cost of the algorithm attains the lower bounds and is therefore communication optimal. Finally, we show that our algorithm can significantly reduce communication compared to the straightforward approach of expressing the computation as a sequence of tensor-times-matrix operations.

線性的 · 優化器 · 控制器 · 貪心 · Analysis ·

2022 年 7 月 20 日

On Linear Power Control Policies for Energy Harvesting Communications

Hafez M. Garmaroudi,Zikai Dou,Shengtian Yang,Jun Chen

from arxiv, 30 pages, 4 figures

A comprehensive analysis of linear power control polices, which include the well-known greedy policy and fixed fraction policy as special cases, is provided. The notions of maximin optimal linear policy for given battery capacity $c$ and mean-to-capacity ratio $p$ as well as its $c$-universal versions are introduced. It is shown, among others, that the fixed fraction policy is $c$-universal additive-gap optimal but not $c$-universal multiplicative-factor optimal. Tight semi-universal bounds on the battery-capacity-threshold for the optimality of the greedy policy are established for certain families of energy arrival distributions.

LSH · 分桶 · 哈希學習 · Extensibility · 模型評估 ·

2022 年 7 月 20 日

DB-LSH: Locality-Sensitive Hashing with Query-based Dynamic Bucketing

Yao Tian,Xi Zhao,Xiaofang Zhou

from arxiv, Accepted by ICDE 2022

Among many solutions to the high-dimensional approximate nearest neighbor (ANN) search problem, locality sensitive hashing (LSH) is known for its sub-linear query time and robust theoretical guarantee on query accuracy. Traditional LSH methods can generate a small number of candidates quickly from hash tables but suffer from large index sizes and hash boundary problems. Recent studies to address these issues often incur extra overhead to identify eligible candidates or remove false positives, making query time no longer sub-linear. To address this dilemma, in this paper we propose a novel LSH scheme called DB-LSH which supports efficient ANN search for large high-dimensional datasets. It organizes the projected spaces with multi-dimensional indexes rather than using fixed-width hash buckets. Our approach can significantly reduce the space cost as by avoiding the need to maintain many hash tables for different bucket sizes. During the query phase of DB-LSH, a small number of high-quality candidates can be generated efficiently by dynamically constructing query-based hypercubic buckets with the required widths through index-based window queries. For a dataset of $n$ $d$-dimensional points with approximation ratio $c$, our rigorous theoretical analysis shows that DB-LSH achieves a smaller query cost ${O(n^{\rho^*} d\log n)}$, where ${\rho^*}$ is bounded by ${1/c^{\alpha}}$ while the bound is ${1/c}$ in the existing work. An extensive range of experiments on real-world data demonstrates the superiority of DB-LSH over state-of-the-art methods on both efficiency and accuracy.

Projection · 奇異值分解 · Analysis · 奇異的 · 可約的 ·

2022 年 7 月 20 日

Time-series image denoising of pressure-sensitive paint data by projected multivariate singular spectrum analysis

Yuya Ohmichi,Kohmi Takahashi,Kazuyuki Nakakita

from arxiv, 16 pages, 12 figures

Time-series data, such as unsteady pressure-sensitive paint (PSP) measurement data, may contain a significant amount of random noise. Thus, in this study, we investigated a noise-reduction method that combines multivariate singular spectrum analysis (MSSA) with low-dimensional data representation. MSSA is a state-space reconstruction technique that utilizes time-delay embedding, and the low-dimensional representation is achieved by projecting data onto the singular value decomposition (SVD) basis. The noise-reduction performance of the proposed method for unsteady PSP data, i.e., the projected MSSA, is compared with that of the truncated SVD method, one of the most employed noise-reduction methods. The result shows that the projected MSSA exhibits better performance in reducing random noise than the truncated SVD method. Additionally, in contrast to that of the truncated SVD method, the performance of the projected MSSA is less sensitive to the truncation rank. Furthermore, the projected MSSA achieves denoising effectively by extracting smooth trajectories in a state space from noisy input data. Expectedly, the projected MSSA will be effective for reducing random noise in not only PSP measurement data, but also various high-dimensional time-series data.

GROUP · 統計量 · Extensibility · 推斷 · 情景 ·

2022 年 7 月 19 日

Inference for high-dimensional split-plot designs with different dimensions between groups

Paavo Sattler,Markus Pauly

In repeated Measure Designs with multiple groups, the primary purpose is to compare different groups in various aspects. For several reasons, the number of measurements and therefore the dimension of the observation vectors can depend on the group, making the usage of existing approaches impossible. We develop an approach which can be used not only for a possibly increasing number of groups $a$, but also for group-depending dimension $d_i$, which is allowed to go to infinity. This is a unique high-dimensional asymptotic framework impressing through its variety and do without usual conditions on the relation between sample size and dimension. It especially includes settings with fixed dimensions in some groups and increasing dimensions in other ones, which can be seen as semi-high-dimensional. To find a appropriate statistic test new and innovative estimators are developed, which can be used under these diverse settings on $a,d_i$ and $n_i$ without any adjustments. We investigated the asymptotic distribution of a quadratic-form-based test statistic and developed an asymptotic correct test. Finally, an extensive simulation study is conducted to investigate the role of the single group's dimension.

Analysis · 目標函數 · 極小點 · 線性的 · 優化器 ·

2022 年 7 月 19 日

Multi-parametric Analysis for Mixed Integer Linear Programming: An Application to Transmission Planning and Congestion Control

Jian Liu,Rui Bo,Siyuan Wang

Enhancing existing transmission lines is a useful tool to combat transmission congestion and guarantee transmission security with increasing demand and boosting the renewable energy source. This study concerns the selection of lines whose capacity should be expanded and by how much from the perspective of independent system operator (ISO) to minimize the system cost with the consideration of transmission line constraints and electricity generation and demand balance conditions, and incorporating ramp-up and startup ramp rates, shutdown ramp rates, ramp-down rate limits and minimum up and minimum down times. For that purpose, we develop the ISO unit commitment and economic dispatch model and show it as a right-hand side uncertainty multiple parametric analysis for the mixed integer linear programming (MILP) problem. We first relax the binary variable to continuous variables and employ the Lagrange method and Karush-Kuhn-Tucker conditions to obtain optimal solutions (optimal decision variables and objective function) and critical regions associated with active and inactive constraints. Further, we extend the traditional branch and bound method for the large-scale MILP problem by determining the upper bound of the problem at each node, then comparing the difference between the upper and lower bounds and reaching the approximate optimal solution within the decision makers' tolerated error range. In additional, the objective function's first derivative on the parameters of each line is used to inform the selection of lines to ease congestion and maximize social welfare. Finally, the amount of capacity upgrade will be chosen by balancing the cost-reduction rate of the objective function on parameters and the cost of the line upgrade. Our findings are supported by numerical simulation and provide transmission line planners with decision-making guidance.

估計/估計量 · 穩健性 · Analysis · 近似 · 損失函數（機器學習） ·

2022 年 7 月 18 日

High-dimensional robust approximated M-estimators for mean regression with asymmetric data

Bin Luo,Xiaoli Gao

Asymmetry along with heteroscedasticity or contamination often occurs with the growth of data dimensionality. In ultra-high dimensional data analysis, such irregular settings are usually overlooked for both theoretical and computational convenience. In this paper, we establish a framework for estimation in high-dimensional regression models using Penalized Robust Approximated quadratic M-estimators (PRAM). This framework allows general settings such as random errors lack of symmetry and homogeneity, or the covariates are not sub-Gaussian. To reduce the possible bias caused by the data's irregularity in mean regression, PRAM adopts a loss function with a flexible robustness parameter growing with the sample size. Theoretically, we first show that, in the ultra-high dimension setting, PRAM estimators have local estimation consistency at the minimax rate enjoyed by the LS-Lasso. Then we show that PRAM with an appropriate non-convex penalty in fact agrees with the local oracle solution, and thus obtain its oracle property. Computationally, we demonstrate the performances of six PRAM estimators using three types of loss functions for approximation (Huber, Tukey's biweight and Cauchy loss) combined with two types of penalty functions (Lasso and MCP). Our simulation studies and real data analysis demonstrate satisfactory finite sample performances of the PRAM estimator under general irregular settings.