青青国产成人久久激情91,欧美体内SHE精高潮,亚洲人成无码WWW久久久久久久,有码无码中文字幕自慰系列,婷婷激情五月天在线观看

Elastic geophysical properties (such as P- and S-wave velocities) are of great importance to various subsurface applications like CO$_2$ sequestration and energy exploration (e.g., hydrogen and geothermal). Elastic full waveform inversion (FWI) is widely applied for characterizing reservoir properties. In this paper, we introduce $\mathbf{\mathbb{E}^{FWI}}$, a comprehensive benchmark dataset that is specifically designed for elastic FWI. $\mathbf{\mathbb{E}^{FWI}}$ encompasses 8 distinct datasets that cover diverse subsurface geologic structures (flat, curve, faults, etc). The benchmark results produced by three different deep learning methods are provided. In contrast to our previously presented dataset (pressure recordings) for acoustic FWI (referred to as OpenFWI), the seismic dataset in $\mathbf{\mathbb{E}^{FWI}}$ has both vertical and horizontal components. Moreover, the velocity maps in $\mathbf{\mathbb{E}^{FWI}}$ incorporate both P- and S-wave velocities. While the multicomponent data and the added S-wave velocity make the data more realistic, more challenges are introduced regarding the convergence and computational cost of the inversion. We conduct comprehensive numerical experiments to explore the relationship between P-wave and S-wave velocities in seismic data. The relation between P- and S-wave velocities provides crucial insights into the subsurface properties such as lithology, porosity, fluid content, etc. We anticipate that $\mathbf{\mathbb{E}^{FWI}}$ will facilitate future research on multiparameter inversions and stimulate endeavors in several critical research topics of carbon-zero and new energy exploration. All datasets, codes and relevant information can be accessed through our website at //efwi-lanl.github.io/

相關內容

全

關注 0

離散化 · Principle · UniFormer · 論文 · 數值分析 ·

2023 年 10 月 23 日

On the monotonicity of $Q^2$ spectral element method for Laplacian on quasi-uniform rectangular meshes

Logan J. Cross,Xiangxiong Zhang

from arxiv, arXiv admin note: substantial text overlap with arXiv:2010.07282

The monotonicity of discrete Laplacian implies discrete maximum principle, which in general does not hold for high order schemes. The $Q^2$ spectral element method has been proven monotone on a uniform rectangular mesh. In this paper we prove the monotonicity of the $Q^2$ spectral element method on quasi-uniform rectangular meshes under certain mesh constraints. In particular, we propose a relaxed Lorenz's condition for proving monotonicity.

優化器 · Learning · 標準正交 · 深度學習 · 歐氏空間 ·

2023 年 10 月 23 日

Simplifying Momentum-based Positive-definite Submanifold Optimization with Applications to Deep Learning

Wu Lin,Valentin Duruisseaux,Melvin Leok,Frank Nielsen,Mohammad Emtiyaz Khan,Mark Schmidt

from arxiv, An updated version of the ICML 2023 paper. Updated the main text to emphasize challenges of using existing Riemannian methods to estimate sparse and structured SPD matrices

Riemannian submanifold optimization with momentum is computationally challenging because, to ensure that the iterates remain on the submanifold, we often need to solve difficult differential equations. Here, we simplify such difficulties for a class of sparse or structured symmetric positive-definite matrices with the affine-invariant metric. We do so by proposing a generalized version of the Riemannian normal coordinates that dynamically orthonormalizes the metric and locally converts the problem into an unconstrained problem in the Euclidean space. We use our approach to simplify existing approaches for structured covariances and develop matrix-inverse-free $2^\text{nd}$-order optimizers for deep learning with low precision by using only matrix multiplications. Code: //github.com/yorkerlin/StructuredNGD-DL

Facebook AI Research · GROUP · 正則的 · 數據轉換 · 特征向量 ·

2023 年 10 月 23 日

A Canonical Data Transformation for Achieving Inter- and Within-group Fairness

Zachary McBride Lazri,Ivan Brugere,Xin Tian,Dana Dachman-Soled,Antigoni Polychroniadou,Danial Dervovic,Min Wu

Increases in the deployment of machine learning algorithms for applications that deal with sensitive data have brought attention to the issue of fairness in machine learning. Many works have been devoted to applications that require different demographic groups to be treated fairly. However, algorithms that aim to satisfy inter-group fairness (also called group fairness) may inadvertently treat individuals within the same demographic group unfairly. To address this issue, we introduce a formal definition of within-group fairness that maintains fairness among individuals from within the same group. We propose a pre-processing framework to meet both inter- and within-group fairness criteria with little compromise in accuracy. The framework maps the feature vectors of members from different groups to an inter-group-fair canonical domain before feeding them into a scoring function. The mapping is constructed to preserve the relative relationship between the scores obtained from the unprocessed feature vectors of individuals from the same demographic group, guaranteeing within-group fairness. We apply this framework to the COMPAS risk assessment and Law School datasets and compare its performance in achieving inter-group and within-group fairness to two regularization-based methods.

MoDELS · motivation · state-of-the-art · Performer · Machine Translation ·

2023 年 10 月 23 日

Adaptive Policy with Wait-$k$ Model for Simultaneous Translation

Libo Zhao,Kai Fan,Wei Luo,Jing Wu,Shushu Wang,Ziqian Zeng,Zhongqiang Huang

from arxiv, Accept to EMNLP 2023 main conference. 17 pages, 12 figures, 5 tables

Simultaneous machine translation (SiMT) requires a robust read/write policy in conjunction with a high-quality translation model. Traditional methods rely on either a fixed wait-$k$ policy coupled with a standalone wait-$k$ translation model, or an adaptive policy jointly trained with the translation model. In this study, we propose a more flexible approach by decoupling the adaptive policy model from the translation model. Our motivation stems from the observation that a standalone multi-path wait-$k$ model performs competitively with adaptive policies utilized in state-of-the-art SiMT approaches. Specifically, we introduce DaP, a divergence-based adaptive policy, that makes read/write decisions for any translation model based on the potential divergence in translation distributions resulting from future information. DaP extends a frozen wait-$k$ model with lightweight parameters, and is both memory and computation efficient. Experimental results across various benchmarks demonstrate that our approach offers an improved trade-off between translation accuracy and latency, outperforming strong baselines.

INTERACT · 語言模型化 · MoDELS · INFORMS · state-of-the-art ·

2023 年 10 月 23 日

I^3 Retriever: Incorporating Implicit Interaction in Pre-trained Language Models for Passage Retrieval

Qian Dong,Yiding Liu,Qingyao Ai,Haitao Li,Shuaiqiang Wang,Yiqun Liu,Dawei Yin,Shaoping Ma

from arxiv, 10 pages

Passage retrieval is a fundamental task in many information systems, such as web search and question answering, where both efficiency and effectiveness are critical concerns. In recent years, neural retrievers based on pre-trained language models (PLM), such as dual-encoders, have achieved huge success. Yet, studies have found that the performance of dual-encoders are often limited due to the neglecting of the interaction information between queries and candidate passages. Therefore, various interaction paradigms have been proposed to improve the performance of vanilla dual-encoders. Particularly, recent state-of-the-art methods often introduce late-interaction during the model inference process. However, such late-interaction based methods usually bring extensive computation and storage cost on large corpus. Despite their effectiveness, the concern of efficiency and space footprint is still an important factor that limits the application of interaction-based neural retrieval models. To tackle this issue, we incorporate implicit interaction into dual-encoders, and propose I^3 retriever. In particular, our implicit interaction paradigm leverages generated pseudo-queries to simulate query-passage interaction, which jointly optimizes with query and passage encoders in an end-to-end manner. It can be fully pre-computed and cached, and its inference process only involves simple dot product operation of the query vector and passage vector, which makes it as efficient as the vanilla dual encoders. We conduct comprehensive experiments on MSMARCO and TREC2019 Deep Learning Datasets, demonstrating the I^3 retriever's superiority in terms of both effectiveness and efficiency. Moreover, the proposed implicit interaction is compatible with special pre-training and knowledge distillation for passage retrieval, which brings a new state-of-the-art performance.

閾值 · Performer · 優化器 · 縮放 · Performance ·

2023 年 10 月 21 日

An $\textit{M}$-ary Concentration Shift Keying With Common Detection Thresholds For Multi-Transmitter Molecular Communication

Ethungshan Shitiri,Ho-Shin Cho

from arxiv, Submitted to IEEE for possible publication

Concentration shift keying (CSK) is a widely adopted modulation technique for molecular communication-based nanonetworks, which is a key enabler for the Internet of Bio-NanoThings (IoBNT). However, existing methods provide optimal error performance at the cost of high operational complexity that scales poorly as the number of transmitters, $K$, increases. This paper proposes a novel $M$-ary CSK method termed CSK with Common detection Thresholds (CSK-CT). CSK-CT uses $\textit{common}$ thresholds that are sufficiently low to ensure the reliable detection of symbols transmitted by every transmitter, regardless of their distance. We derive closed-form expressions to obtain the common thresholds and release concentrations. To enhance the error performance, we optimize the release concentration using a scaling exponent that further optimizes the common thresholds. We evaluate the performance of CSK-CT in comparison to the benchmark CSK for varying values of $K$ and $M$. In terms of the error probability, CSK-CT offers between $10^{-7}$ and $10^{-4}$, which are a substantial improvement from the $10^{-4}$ to $10^{-3}$ offered by the benchmark. In terms of complexity, CSK-CT is $\textit{O}\big(n\big)$ and does not scale with $K$ but $M$ ($M\ll K$), while the benchmark is $\textit{O}\big(n^2\big)$. Furthermore, CSK-CT showcased the ability to mitigate inter-symbol interference, although this facet warrants further investigation. Due to its low error probability, improved scalability, low complexity, and potential ISI mitigation features, CSK-CT demonstrates benefits in applications of IoBNT focused on data-gathering. Specifically, its utility is well-noted in settings where a computationally strained receiver collects sensitive health-related data from multiple transmitters.

SRDS · 次最優 · Boosting（一種模型訓練加速方式） · 值域 · 全 ·

2023 年 10 月 20 日

Breaking the $O(\sqrt n)$-Bit Barrier: Byzantine Agreement with Polylog Bits Per Party

Elette Boyle,Ran Cohen,Aarushi Goel

from arxiv, Preliminary version appeared in PODC'21; full version appeared in Journal of Cryptology 2023

Byzantine agreement (BA), the task of $n$ parties to agree on one of their input bits in the face of malicious agents, is a powerful primitive that lies at the core of a vast range of distributed protocols. Interestingly, in protocols with the best overall communication, the demands of the parties are highly unbalanced: the amortized cost is $\tilde O(1)$ bits per party, but some parties must send $\Omega(n)$ bits. In best known balanced protocols, the overall communication is sub-optimal, with each party communicating $\tilde O(\sqrt{n})$. In this work, we ask whether asymmetry is inherent for optimizing total communication. Our contributions in this line are as follows: 1) We define a cryptographic primitive, succinctly reconstructed distributed signatures (SRDS), that suffices for constructing $\tilde O(1)$ balanced BA. We provide two constructions of SRDS from different cryptographic and Public-Key Infrastructure (PKI) assumptions. 2) The SRDS-based BA follows a paradigm of boosting from "almost-everywhere" agreement to full agreement, and does so in a single round. We prove that PKI setup and cryptographic assumptions are necessary for such protocols in which every party sends $o(n)$ messages. 3) We further explore connections between a natural approach toward attaining SRDS and average-case succinct non-interactive argument systems (SNARGs) for a particular type of NP-Complete problems (generalizing Subset-Sum and Subset-Product). Our results provide new approaches forward, as well as limitations and barriers, towards minimizing per-party communication of BA. In particular, we construct the first two BA protocols with $\tilde O(1)$ balanced communication, offering a tradeoff between setup and cryptographic assumptions, and answering an open question presented by King and Saia (DISC'09).

Storage · Weight · Processing（編程語言） · 可約的 · Integration ·

2023 年 10 月 20 日

Memory-efficient compression of $\mathcal{DH}^2$-matrices for high-frequency problems

Steffen B?rm,Janne Henningsen

Directional interpolation is a fast and efficient compression technique for high-frequency Helmholtz boundary integral equations, but it requires a very large amount of storage in its original form. Algebraic recompression can significantly reduce the storage requirements and speed up the solution process accordingly. During the recompression process, weight matrices are required to correctly measure the influence of different basis vectors on the final result, and for highly accurate approximations, these weight matrices require more storage than the final compressed matrix. We present a compression method for the weight matrices and demonstrate that it introduces only a controllable error to the overall approximation. Numerical experiments show that the new method leads to a significant reduction in storage requirements.

Networking · 曲率 · INFORMS · Networks · motivation ·

2023 年 10 月 19 日

Efficient set-theoretic algorithms for computing high-order Forman-Ricci curvature on abstract simplicial complexes

Danillo Barros de Souza,Jonatas T. S. da Cunha,Fernando A. N. Santos,Jürgen Jost,Serafim Rodrigues

Differential geometric approaches are ubiquitous in several fields of mathematics, physics and engineering, and their discretizations enable the development of network-based mathematical and computational frameworks, which are essential for large-scale data science. The Forman-Ricci curvature (FRC) - a statistical measure based on Riemannian geometry and designed for networks - is known for its high capacity for extracting geometric information from complex networks. However, extracting information from dense networks is still challenging due to the combinatorial explosion of high-order network structures. Motivated by this challenge we sought a set-theoretic representation theory for high-order network cells and FRC, as well as their associated concepts and properties, which together provide an alternative and efficient formulation for computing high-order FRC in complex networks. We provide a pseudo-code, a software implementation coined FastForman, as well as a benchmark comparison with alternative implementations. Crucially, our representation theory reveals previous computational bottlenecks and also accelerates the computation of FRC. As a consequence, our findings open new research possibilities in complex systems where higher-order geometric computations are required.

圖卷積神經網絡/圖卷積網絡 · 圖 · 圖卷積 · 圖卷積網絡 · 學成 ·

2020 年 3 月 30 日

L^2-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks

Yuning You,Tianlong Chen,Zhangyang Wang,Yang Shen

from arxiv, CVPR 2020

Graph convolution networks (GCN) are increasingly popular in many applications, yet remain notoriously hard to train over large graph datasets. They need to compute node representations recursively from their neighbors. Current GCN training algorithms suffer from either high computational costs that grow exponentially with the number of layers, or high memory usage for loading the entire graph and node embeddings. In this paper, we propose a novel efficient layer-wise training framework for GCN (L-GCN), that disentangles feature aggregation and feature transformation during training, hence greatly reducing time and memory complexities. We present theoretical analysis for L-GCN under the graph isomorphism framework, that L-GCN leads to as powerful GCNs as the more costly conventional training algorithm does, under mild conditions. We further propose L^2-GCN, which learns a controller for each layer that can automatically adjust the training epochs per layer in L-GCN. Experiments show that L-GCN is faster than state-of-the-arts by at least an order of magnitude, with a consistent of memory usage not dependent on dataset size, while maintaining comparable prediction performance. With the learned controller, L^2-GCN can further cut the training time in half. Our codes are available at //github.com/Shen-Lab/L2-GCN.