露脸视频一区二区三区在线播放_强奸国产舒服网站_日本又黄又爽又猛的视频_成年人免费视频中文字幕_婷婷成人亚洲综合五月天_国产AV无码精品一品二区三区_亚洲日韩精品一区二区

Physics-Informed Neural Network (PINN) has proven itself a powerful tool to obtain the numerical solutions of nonlinear partial differential equations (PDEs) leveraging the expressivity of deep neural networks and the computing power of modern heterogeneous hardware. However, its training is still time-consuming, especially in the multi-query and real-time simulation settings, and its parameterization often overly excessive. In this paper, we propose the Generative Pre-Trained PINN (GPT-PINN) to mitigate both challenges in the setting of parametric PDEs. GPT-PINN represents a brand-new meta-learning paradigm for parametric systems. As a network of networks, its outer-/meta-network is hyper-reduced with only one hidden layer having significantly reduced number of neurons. Moreover, its activation function at each hidden neuron is a (full) PINN pre-trained at a judiciously selected system configuration. The meta-network adaptively ``learns'' the parametric dependence of the system and ``grows'' this hidden layer one neuron at a time. In the end, by encompassing a very small number of networks trained at this set of adaptively-selected parameter values, the meta-network is capable of generating surrogate solutions for the parametric system across the entire parameter domain accurately and efficiently.

相關內容

Networking

關注 22

Networking：IFIP International Conferences on Networking。 Explanation：國際網絡會議。 Publisher：IFIP。 SIT：

Performer · 可理解性 · 離散化 · Analysis · 線性的 ·

2023 年 8 月 2 日

Non-intrusive implementation of a wide variety of Multiscale Finite Element Methods

Rutger A. Biezemans,Claude Le Bris,Frédéric Legoll,Alexei Lozinski

from arxiv, 50 pages, 4 figures; typos corrected (notably in the proof of Lemma 3), some clarifications added, numerical results added in Section 7

Multiscale Finite Element Methods (MsFEMs) are now well-established finite element type approaches dedicated to multiscale problems. They first compute local, oscillatory, problem-dependent basis functions that generate a suitable discretization space, and next perform a Galerkin approximation of the problem on that space. We investigate here how these approaches can be implemented in a non-intrusive way, in order to facilitate their dissemination within industrial codes or non-academic environments. We develop an abstract framework that covers a wide variety of MsFEMs for linear second-order partial differential equations. Non-intrusive MsFEM approaches are developed within the full generality of this framework, which may moreover be beneficial to steering software development and improving the theoretical understanding and analysis of MsFEMs.

CASES · 分段 · 線性的 · 近似 · 論文 ·

2023 年 8 月 2 日

The Rhie-Chow stabilized Box Method for the Stokes problem

G. Negrini,N. Parolini,M. Verani

from arxiv, 27 pages, 6 figures, 4 tables

The Finite Volume method (FVM) is widely adopted in many different applications because of its built-in conservation properties, its ability to deal with arbitrary mesh and its computational efficiency. In this work, we consider the Rhie-Chow stabilized Box Method (RCBM) for the approximation of the Stokes problem. The Box Method (BM) is a piecewise linear Petrov-Galerkin formulation on the Voronoi dual mesh of a Delaunay triangulation, whereas the Rhie-Chow (RC) stabilization is a well known stabilization technique for FVM. The first part of the paper provides a variational formulation of the RC stabilization and discusses the validity of crucial properties relevant for the well-posedeness and convergence of RCBM. Moreover, a numerical exploration of the convergence properties of the method on 2D and 3D test cases is presented. The last part of the paper considers the theoretically justification of the well-posedeness of RCBM and the experimentally observed convergence rates. This latter justification hinges upon suitable assumptions, whose validity is numerically explored.

UniFormer · MoDELS · 模型評估 · ForCES · 近似 ·

2023 年 8 月 1 日

Multi-frequency averaging and uniform accuracy towards numerical approximations for a Bloch model

Brigitte Bidégaray-Fesquet,Clément Jourdana,Léopold Trémant

We are interested in numerically solving a transitional model derived from the Bloch model. The Bloch equation describes the time evolution of the density matrix of a quantum system forced by an electromagnetic wave. In a high frequency and low amplitude regime, it asymptotically reduces to a non-stiff rate equation. As a middle ground, the transitional model governs the diagonal part of the density matrix. It fits in a general setting of linear problems with a high-frequency quasi-periodic forcing and an exponentially decaying forcing. The numerical resolution of such problems is challenging. Adapting high-order averaging techniques to this setting, we separate the slow (rate) dynamics from the fast (oscillatory and decay) dynamics to derive a new micro-macro problem. We derive estimates for the size of the micro part of the decomposition, and of its time derivatives, showing that this new problem is non-stiff. As such, we may solve this micro-macro problem with uniform accuracy using standard numerical schemes. To validate this approach, we present numerical results first on a toy problem and then on the transitional Bloch model.

簇 · 類別 · 降維 · 原點 · 分離的 ·

2023 年 8 月 1 日

Classes are not Clusters: Improving Label-based Evaluation of Dimensionality Reduction

Hyeon Jeon,Yun-Hsin Kuo,Micha?l Aupetit,Kwan-Liu Ma,Jinwook Seo

from arxiv, IEEE Transactions on Visualization and Computer Graphics (TVCG) (Proc. IEEE VIS 2023)

A common way to evaluate the reliability of dimensionality reduction (DR) embeddings is to quantify how well labeled classes form compact, mutually separated clusters in the embeddings. This approach is based on the assumption that the classes stay as clear clusters in the original high-dimensional space. However, in reality, this assumption can be violated; a single class can be fragmented into multiple separated clusters, and multiple classes can be merged into a single cluster. We thus cannot always assure the credibility of the evaluation using class labels. In this paper, we introduce two novel quality measures -- Label-Trustworthiness and Label-Continuity (Label-T&C) -- advancing the process of DR evaluation based on class labels. Instead of assuming that classes are well-clustered in the original space, Label-T&C work by (1) estimating the extent to which classes form clusters in the original and embedded spaces and (2) evaluating the difference between the two. A quantitative evaluation showed that Label-T&C outperform widely used DR evaluation measures (e.g., Trustworthiness and Continuity, Kullback-Leibler divergence) in terms of the accuracy in assessing how well DR embeddings preserve the cluster structure, and are also scalable. Moreover, we present case studies demonstrating that Label-T&C can be successfully used for revealing the intrinsic characteristics of DR techniques and their hyperparameters.

優化器 · Learning · 標準正交 · 深度學習 · 歐氏空間 ·

2023 年 8 月 1 日

Simplifying Momentum-based Positive-definite Submanifold Optimization with Applications to Deep Learning

Wu Lin,Valentin Duruisseaux,Melvin Leok,Frank Nielsen,Mohammad Emtiyaz Khan,Mark Schmidt

from arxiv, An updated version of the ICML 2023 paper. Updated the main text to emphasize challenges of using existing Riemannian methods to estimate sparse and structured SPD matrices

Riemannian submanifold optimization with momentum is computationally challenging because, to ensure that the iterates remain on the submanifold, we often need to solve difficult differential equations. Here, we simplify such difficulties for a class of sparse or structured symmetric positive-definite matrices with the affine-invariant metric. We do so by proposing a generalized version of the Riemannian normal coordinates that dynamically orthonormalizes the metric and locally converts the problem into an unconstrained problem in the Euclidean space. We use our approach to simplify existing approaches for structured covariances and develop matrix-inverse-free $2^\text{nd}$-order optimizers for deep learning with low precision by using only matrix multiplications. Code: //github.com/yorkerlin/StructuredNGD-DL

Principle · 線性的 · Branch · 圖 · 完全圖 ·

2023 年 7 月 31 日

Depth lower bounds in Stabbing Planes for combinatorial principles

Stefan Dantchev,Nicola Galesi,Abdul Ghani,Barnaby Martin

Stabbing Planes (also known as Branch and Cut) is a proof system introduced very recently which, informally speaking, extends the DPLL method by branching on integer linear inequalities instead of single variables. The techniques known so far to prove size and depth lower bounds for Stabbing Planes are generalizations of those used for the Cutting Planes proof system. For size lower bounds these are established by monotone circuit arguments, while for depth these are found via communication complexity and protection. As such these bounds apply for lifted versions of combinatorial statements. Rank lower bounds for Cutting Planes are also obtained by geometric arguments called protection lemmas. In this work we introduce two new geometric approaches to prove size/depth lower bounds in Stabbing Planes working for any formula: (1) the antichain method, relying on Sperner's Theorem and (2) the covering method which uses results on essential coverings of the boolean cube by linear polynomials, which in turn relies on Alon's combinatorial Nullenstellensatz. We demonstrate their use on classes of combinatorial principles such as the Pigeonhole principle, the Tseitin contradictions and the Linear Ordering Principle. By the first method we prove almost linear size lower bounds and optimal logarithmic depth lower bounds for the Pigeonhole principle and analogous lower bounds for the Tseitin contradictions over the complete graph and for the Linear Ordering Principle. By the covering method we obtain a superlinear size lower bound and a logarithmic depth lower bound for Stabbing Planes proof of Tseitin contradictions over a grid graph.

共軛梯度 · 共軛 · 優化器 · 非線性共軛梯度 · 駐點 ·

2023 年 7 月 30 日

On the convergence of orthogonalization-free conjugate gradient method for extreme eigenvalues of Hermitian matrices: a Riemannian optimization interpretation

Shixin Zheng,Haizhao Yang,Xiangxiong Zhang

In many applications, it is desired to obtain extreme eigenvalues and eigenvectors of large Hermitian matrices by efficient and compact algorithms. In particular, orthogonalization-free methods are preferred for large-scale problems for finding eigenspaces of extreme eigenvalues without explicitly computing orthogonal vectors in each iteration. For the top $p$ eigenvalues, the simplest orthogonalization-free method is to find the best rank-$p$ approximation to a positive semi-definite Hermitian matrix by algorithms solving the unconstrained Burer-Monteiro formulation. We show that the nonlinear conjugate gradient method for the unconstrained Burer-Monteiro formulation is equivalent to a Riemannian conjugate gradient method on a quotient manifold with the Bures-Wasserstein metric, thus its global convergence to a stationary point can be proven. Numerical tests suggest that it is efficient for computing the largest $k$ eigenvalues for large-scale matrices if the largest $k$ eigenvalues are nearly distributed uniformly.

Engineering · 有向 · 設計 · INFORMS · Machine Learning ·

2023 年 7 月 28 日

Multi-modal Machine Learning in Engineering Design: A Review and Future Directions

Binyang Song,Rui Zhou,Faez Ahmed

In the rapidly advancing field of multi-modal machine learning (MMML), the convergence of multiple data modalities has the potential to reshape various applications. This paper presents a comprehensive overview of the current state, advancements, and challenges of MMML within the sphere of engineering design. The review begins with a deep dive into five fundamental concepts of MMML:multi-modal information representation, fusion, alignment, translation, and co-learning. Following this, we explore the cutting-edge applications of MMML, placing a particular emphasis on tasks pertinent to engineering design, such as cross-modal synthesis, multi-modal prediction, and cross-modal information retrieval. Through this comprehensive overview, we highlight the inherent challenges in adopting MMML in engineering design, and proffer potential directions for future research. To spur on the continued evolution of MMML in engineering design, we advocate for concentrated efforts to construct extensive multi-modal design datasets, develop effective data-driven MMML techniques tailored to design applications, and enhance the scalability and interpretability of MMML models. MMML models, as the next generation of intelligent design tools, hold a promising future to impact how products are designed.

蒙特卡羅 · Extensibility · 泰勒級數 · Processing（編程語言） · 泰勒 ·

2023 年 7 月 28 日

Stochastic automatic differentiation for Monte Carlo processes

Guilherme Catumba,Alberto Ramos,Bryan Zaldivar

from arxiv, 21 pages, 5 images, 2 tables

Monte Carlo methods represent a cornerstone of computer science. They allow to sample high dimensional distribution functions in an efficient way. In this paper we consider the extension of Automatic Differentiation (AD) techniques to Monte Carlo process, addressing the problem of obtaining derivatives (and in general, the Taylor series) of expectation values. Borrowing ideas from the lattice field theory community, we examine two approaches. One is based on reweighting while the other represents an extension of the Hamiltonian approach typically used by the Hybrid Monte Carlo (HMC) and similar algorithms. We show that the Hamiltonian approach can be understood as a change of variables of the reweighting approach, resulting in much reduced variances of the coefficients of the Taylor series. This work opens the door to find other variance reduction techniques for derivatives of expectation values.

知識 (knowledge) · 蒸餾 · Processing（編程語言） · 可約的 · 易處理的 ·

2023 年 7 月 27 日

f-Divergence Minimization for Sequence-Level Knowledge Distillation

Yuqiao Wen,Zichao Li,Wenyu Du,Lili Mou

from arxiv, Accepted by ACL 2023

Knowledge distillation (KD) is the process of transferring knowledge from a large model to a small one. It has gained increasing attention in the natural language processing community, driven by the demands of compressing ever-growing language models. In this work, we propose an f-DISTILL framework, which formulates sequence-level knowledge distillation as minimizing a generalized f-divergence function. We propose four distilling variants under our framework and show that existing SeqKD and ENGINE approaches are approximations of our f-DISTILL methods. We further derive step-wise decomposition for our f-DISTILL, reducing intractable sequence-level divergence to word-level losses that can be computed in a tractable manner. Experiments across four datasets show that our methods outperform existing KD approaches, and that our symmetric distilling losses can better force the student to learn from the teacher distribution.