国产一本二本三本的区别视频_天天插天天摸久久久_日韩一区二区视频_国产精品久久久久国产网址_亚洲伦理中文字幕_亚州欧美视频一区二区三区_国产亚洲男人的天堂在线观看

The deep-learning-based least squares method has shown successful results in solving high-dimensional non-linear partial differential equations (PDEs). However, this method usually converges slowly. To speed up the convergence of this approach, an active-learning-based sampling algorithm is proposed in this paper. This algorithm actively chooses the most informative training samples from a probability density function based on residual errors to facilitate error reduction. In particular, points with larger residual errors will have more chances of being selected for training. This algorithm imitates the human learning process: learners are likely to spend more time repeatedly studying mistakes than other tasks they have correctly finished. A series of numerical results are illustrated to demonstrate the effectiveness of our active-learning-based sampling in high dimensions to speed up the convergence of the deep-learning-based least squares method.

相關內容

主動學習

關注 240

主(zhu)(zhu)動(dong)學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)是(shi)(shi)機器學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)（更(geng)普遍的(de)(de)說是(shi)(shi)人工(gong)智能）的(de)(de)一個(ge)子領域(yu)，在統(tong)計學(xue)(xue)(xue)(xue)(xue)領域(yu)也(ye)叫(jiao)查詢學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)、最優(you)實驗(yan)設計。“學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)模塊(kuai)(kuai)”和(he)“選(xuan)擇策略”是(shi)(shi)主(zhu)(zhu)動(dong)學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)算法的(de)(de)2個(ge)基本(ben)且重要(yao)的(de)(de)模塊(kuai)(kuai)。主(zhu)(zhu)動(dong)學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)是(shi)(shi)“一種學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)方(fang)(fang)法，在這種方(fang)(fang)法中(zhong)，學(xue)(xue)(xue)(xue)(xue)生會主(zhu)(zhu)動(dong)或體(ti)驗(yan)性地(di)參(can)與學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)過(guo)程，并(bing)且根(gen)據學(xue)(xue)(xue)(xue)(xue)生的(de)(de)參(can)與程度(du)，有不同(tong)程度(du)的(de)(de)主(zhu)(zhu)動(dong)學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)。” （Bonwell＆Eison 1991）Bonwell＆Eison（1991）指出：“學(xue)(xue)(xue)(xue)(xue)生除(chu)了(le)被(bei)動(dong)地(di)聽課以外，還(huan)從事(shi)其(qi)他(ta)(ta)活動(dong)。” 在高等教(jiao)育研(yan)究協會（ASHE）的(de)(de)一份(fen)報(bao)告中(zhong)，作者討論了(le)各種促進主(zhu)(zhu)動(dong)學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)的(de)(de)方(fang)(fang)法。他(ta)(ta)們引用了(le)一些(xie)文獻(xian)，這些(xie)文獻(xian)表明學(xue)(xue)(xue)(xue)(xue)生不僅要(yao)做(zuo)聽，還(huan)必須(xu)做(zuo)更(geng)多的(de)(de)事(shi)情才能學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)。他(ta)(ta)們必須(xu)閱讀，寫作，討論并(bing)參(can)與解決問題。此過(guo)程涉及三個(ge)學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)領域(yu)，即知識，技能和(he)態度(du)（KSA）。這種學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)行為分類法可以被(bei)認為是(shi)(shi)“學(xue)(xue)(xue)(xue)(xue)習(xi)(xi)(xi)過(guo)程的(de)(de)目標”。特(te)別(bie)是(shi)(shi)，學(xue)(xue)(xue)(xue)(xue)生必須(xu)從事(shi)諸如分析(xi)，綜合和(he)評估之(zhi)類的(de)(de)高級思維任務。

知識 (knowledge) · 相互獨立的 · 近似 · 數值分析 ·

2022 年 4 月 18 日

Utilizing Time-Reversibility for Shock Capturing in Nonlinear Hyperbolic Conservation Laws

Tarik Dzanic,Will Trojak,Freddie D. Witherden

from arxiv, 20 pages, 14 figures

In this work, we introduce a novel approach to formulating an artificial viscosity for shock capturing in nonlinear hyperbolic systems by utilizing the property that the solutions of hyperbolic conservation laws are not reversible in time in the vicinity of shocks. The proposed approach does not require any additional governing equations or a priori knowledge of the hyperbolic system in question, is independent of the mesh and approximation order, and requires the use of only one tunable parameter. The primary novelty is that the resulting artificial viscosity is unique for each component of the conservation law which is advantageous for systems in which some components exhibit discontinuities while others do not. The efficacy of the method is shown in numerical experiments of multi-dimensional hyperbolic conservation laws such as nonlinear transport, Euler equations, and ideal magnetohydrodynamics using a high-order discontinuous spectral element method on unstructured grids.

近似 · 維數災難 · 前向 · 后向 · 優化器 ·

2022 年 4 月 18 日

Multilevel Picard approximations for high-dimensional decoupled forward-backward stochastic differential equations

Martin Hutzenthaler,Tuan Anh Nguyen

from arxiv, 39 pages

Backward stochastic differential equations (BSDEs) appear in numeruous applications. Classical approximation methods suffer from the curse of dimensionality and deep learning-based approximation methods are not known to converge to the BSDE solution. Recently, Hutzenthaler et al. (arXiv:2108.10602) introduced a new approximation method for BSDEs whose forward diffusion is Brownian motion and proved that this method converges with essentially optimal rate without suffering from the curse of dimensionality. The central object of this article is to extend this result to general forward diffusions. The main challenge is that we need to establish convergence in temporal-spatial H\"older norms since the forward diffusion cannot be sampled exactly in general.

提議分布 · 估計/估計量 · 重要性采樣 · 方差 · 樣本 ·

2022 年 4 月 18 日

Selection of proposal distributions for multiple importance sampling

Vivekananda Roy,Evangelos Evangelou

The naive importance sampling (IS) estimator generally does not work well in examples involving simultaneous inference on several targets, as the importance weights can take arbitrarily large values, making the estimator highly unstable. In such situations, alternative multiple IS estimators involving samples from multiple proposal distributions are preferred. Just like the naive IS, the success of these multiple IS estimators crucially depends on the choice of the proposal distributions. The selection of these proposal distributions is the focus of this article. We propose three methods: (i) a geometric space filling approach, (ii) a minimax variance approach, and (iii) a maximum entropy approach. The first two methods are applicable to any IS estimator, whereas the third approach is described in the context of Doss's (2010) two-stage IS estimator. For the first method, we propose a suitable measure of 'closeness' based on the symmetric Kullback-Leibler divergence, while the second and third approaches use estimates of asymptotic variances of Doss's (2010) IS estimator and Geyer's (1994) reverse logistic regression estimator, respectively. Thus, when samples from the proposal distributions are obtained by running Markov chains, we provide consistent spectral variance estimators for these asymptotic variances. The proposed methods for selecting proposal densities are illustrated using various detailed examples.

子采樣 · 嶺回歸 · 優化器 · 隨機采樣 · Performer ·

2022 年 4 月 18 日

Optimal Subsampling for High-dimensional Ridge Regression

Hanyu Li,Chengmei Niu

We investigate the feature compression of high-dimensional ridge regression using the optimal subsampling technique. Specifically, based on the basic framework of random sampling algorithm on feature for ridge regression and the A-optimal design criterion, we first obtain a set of optimal subsampling probabilities. Considering that the obtained probabilities are uneconomical, we then propose the nearly optimal ones. With these probabilities, a two step iterative algorithm is established which has lower computational cost and higher accuracy. We provide theoretical analysis and numerical experiments to support the proposed methods. Numerical results demonstrate the decent performance of our methods.

Neural Networks · 學成 · Networking · 損失函數（機器學習） · 近似 ·

2022 年 4 月 18 日

A Deep Learning Galerkin Method for the Closed-Loop Geothermal System

Wen Zhang,Jian Li

from arxiv, 29 pages, 7 figures, 1 tables

There has been an arising trend of adopting deep learning methods to study partial differential equations (PDEs). This article is to propose a Deep Learning Galerkin Method (DGM) for the closed-loop geothermal system, which is a new coupled multi-physics PDEs and mainly consists of a framework of underground heat exchange pipelines to extract the geothermal heat from the geothermal reservoir. This method is a natural combination of Galerkin Method and machine learning with the solution approximated by a neural network instead of a linear combination of basis functions. We train the neural network by randomly sampling the spatiotemporal points and minimize loss function to satisfy the differential operators, initial condition, boundary and interface conditions. Moreover, the approximate ability of the neural network is proved by the convergence of the loss function and the convergence of the neural network to the exact solution in L^2 norm under certain conditions. Finally, some numerical examples are carried out to demonstrate the approximation ability of the neural networks intuitively.

廣義線性模型 · 遷移學習 · 學成 · 線性模型 · INFORMS ·

2022 年 4 月 17 日

Transfer Learning under High-dimensional Generalized Linear Models

Ye Tian,Yang Feng

from arxiv, 94 pages, 11 figures

In this work, we study the transfer learning problem under high-dimensional generalized linear models (GLMs), which aim to improve the fit on target data by borrowing information from useful source data. Given which sources to transfer, we propose a transfer learning algorithm on GLM, and derive its $\ell_1/\ell_2$-estimation error bounds as well as a bound for a prediction error measure. The theoretical analysis shows that when the target and source are sufficiently close to each other, these bounds could be improved over those of the classical penalized estimator using only target data under mild conditions. When we don't know which sources to transfer, an algorithm-free transferable source detection approach is introduced to detect informative sources. The detection consistency is proved under the high-dimensional GLM transfer learning setting. We also propose an algorithm to construct confidence intervals of each coefficient component, and the corresponding theories are provided. Extensive simulations and a real-data experiment verify the effectiveness of our algorithms. We implement the proposed GLM transfer learning algorithms in a new R package glmtrans, which is available on CRAN.

動力系統 · 邊緣化 · MoDELS · 估計誤差 · Performer ·

2022 年 4 月 15 日

Space-sequential particle filters for high-dimensional dynamical systems described by stochastic differential equations

Deniz Akyildiz,Dan Crisan,Joaquin Miguez

We introduce a novel methodology for particle filtering in dynamical systems where the evolution of the signal of interest is described by a SDE and observations are collected instantaneously at prescribed time instants. The new approach includes the discretisation of the SDE and the design of efficient particle filters for the resulting discrete-time state-space model. The discretisation scheme converges with weak order 1 and it is devised to create a sequential dependence structure along the coordinates of the discrete-time state vector. We introduce a class of space-sequential particle filters that exploits this structure to improve performance when the system dimension is large. This is numerically illustrated by a set of computer simulations for a stochastic Lorenz 96 system with additive noise. The new space-sequential particle filters attain approximately constant estimation errors as the dimension of the Lorenz 96 system is increased, with a computational cost that increases polynomially, rather than exponentially, with the system dimension. Besides the new numerical scheme and particle filters, we provide in this paper a general framework for discrete-time filtering in continuous-time dynamical systems described by a SDE and instantaneous observations. Provided that the SDE is discretised using a weakly-convergent scheme, we prove that the marginal posterior laws of the resulting discrete-time state-space model converge to the posterior marginal posterior laws of the original continuous-time state-space model under a suitably defined metric. This result is general and not restricted to the numerical scheme or particle filters specifically studied in this manuscript.

模型選擇 · MoDELS · 蒙特卡羅 · 似然 · 邊緣似然函數 ·

2022 年 4 月 15 日

Proximal nested sampling for high-dimensional Bayesian model selection

Xiaohao Cai,Jason D. McEwen,Marcelo Pereyra

Bayesian model selection provides a powerful framework for objectively comparing models directly from observed data, without reference to ground truth data. However, Bayesian model selection requires the computation of the marginal likelihood (model evidence), which is computationally challenging, prohibiting its use in many high-dimensional Bayesian inverse problems. With Bayesian imaging applications in mind, in this work we present the proximal nested sampling methodology to objectively compare alternative Bayesian imaging models for applications that use images to inform decisions under uncertainty. The methodology is based on nested sampling, a Monte Carlo approach specialised for model comparison, and exploits proximal Markov chain Monte Carlo techniques to scale efficiently to large problems and to tackle models that are log-concave and not necessarily smooth (e.g., involving l_1 or total-variation priors). The proposed approach can be applied computationally to problems of dimension O(10^6) and beyond, making it suitable for high-dimensional inverse imaging problems. It is validated on large Gaussian models, for which the likelihood is available analytically, and subsequently illustrated on a range of imaging problems where it is used to analyse different choices of dictionary and measurement model.

圖 · 學成 · MoDELS · Extensibility · 深度學習 ·

2022 年 2 月 24 日

Bayesian Deep Learning for Graphs

Federico Errica

from arxiv, PhD Thesis

The adaptive processing of structured data is a long-standing research topic in machine learning that investigates how to automatically learn a mapping from a structured input to outputs of various nature. Recently, there has been an increasing interest in the adaptive processing of graphs, which led to the development of different neural network-based methodologies. In this thesis, we take a different route and develop a Bayesian Deep Learning framework for graph learning. The dissertation begins with a review of the principles over which most of the methods in the field are built, followed by a study on graph classification reproducibility issues. We then proceed to bridge the basic ideas of deep learning for graphs with the Bayesian world, by building our deep architectures in an incremental fashion. This framework allows us to consider graphs with discrete and continuous edge features, producing unsupervised embeddings rich enough to reach the state of the art on several classification tasks. Our approach is also amenable to a Bayesian nonparametric extension that automatizes the choice of almost all model's hyper-parameters. Two real-world applications demonstrate the efficacy of deep learning for graphs. The first concerns the prediction of information-theoretic quantities for molecular simulations with supervised neural models. After that, we exploit our Bayesian models to solve a malware-classification task while being robust to intra-procedural code obfuscation techniques. We conclude the dissertation with an attempt to blend the best of the neural and Bayesian worlds together. The resulting hybrid model is able to predict multimodal distributions conditioned on input graphs, with the consequent ability to model stochasticity and uncertainty better than most works. Overall, we aim to provide a Bayesian perspective into the articulated research field of deep learning for graphs.

MoDELS · 學成 · Networking · 動力系統 · Neural Networks ·

2022 年 2 月 4 日

On Neural Differential Equations

Patrick Kidger

from arxiv, Doctoral thesis, Mathematical Institute, University of Oxford. 231 pages

The conjoining of dynamical systems and deep learning has become a topic of great interest. In particular, neural differential equations (NDEs) demonstrate that neural networks and differential equation are two sides of the same coin. Traditional parameterised differential equations are a special case. Many popular neural network architectures, such as residual networks and recurrent networks, are discretisations. NDEs are suitable for tackling generative problems, dynamical systems, and time series (particularly in physics, finance, ...) and are thus of interest to both modern machine learning and traditional mathematical modelling. NDEs offer high-capacity function approximation, strong priors on model space, the ability to handle irregular data, memory efficiency, and a wealth of available theory on both sides. This doctoral thesis provides an in-depth survey of the field. Topics include: neural ordinary differential equations (e.g. for hybrid neural/mechanistic modelling of physical systems); neural controlled differential equations (e.g. for learning functions of irregular time series); and neural stochastic differential equations (e.g. to produce generative models capable of representing complex stochastic dynamics, or sampling from complex high-dimensional distributions). Further topics include: numerical methods for NDEs (e.g. reversible differential equations solvers, backpropagation through differential equations, Brownian reconstruction); symbolic regression for dynamical systems (e.g. via regularised evolution); and deep implicit models (e.g. deep equilibrium models, differentiable optimisation). We anticipate this thesis will be of interest to anyone interested in the marriage of deep learning with dynamical systems, and hope it will provide a useful reference for the current state of the art.