苹果电影在线观看免费高清_国产肥熟女一区二区三区_思思热在线视频精品_天天综合在线视频_欧美精品69一区二区三区_欧美亚洲日韩色另类乱_成年动漫AV网免费

We introduce two algorithms based on a policy iteration method to numerically solve time-dependent Mean Field Game systems of partial differential equations with non-separable Hamiltonians. We prove the convergence of such algorithms in sufficiently small time intervals with Banach fixed point method. Moreover, we prove that the convergence rates are linear. We illustrate our theoretical results by numerical examples, and we discuss the performance of the proposed algorithms.

相關內容

策略(lve)迭代

關注 0

FAST · 情景 · 圖 · 總回報 · 離散數學 ·

2021 年 11 月 29 日

Fast algorithms for solving the Hamilton Cycle problem with high probability

Michael Anastos

We study the Hamilton cycle problem with input a random graph G=G(n,p) in two settings. In the first one, G is given to us in the form of randomly ordered adjacency lists while in the second one we are given the adjacency matrix of G. In each of the settings we give a deterministic algorithm that w.h.p. either it finds a Hamilton cycle or it returns a certificate that such a cycle does not exists, for p > 0. The running times of our algorithms are w.h.p. O(n) and O(n/p) respectively each being best possible in its own setting.

離散化 · Conformer · Continuity · Pair · 優化器 ·

2021 年 11 月 28 日

Augmented finite element formulation for the Navier--Stokes equations with vorticity and variable viscosity

Veronica Anaya,Ruben Caraballo,Ricardo Ruiz-Baier,Hector Torres

We propose and analyse an augmented mixed finite element method for the Navier--Stokes equations written in terms of velocity, vorticity, and pressure with non-constant viscosity and no-slip boundary conditions. The weak formulation includes least-squares terms arising from the constitutive equation and from the incompressibility condition, and we use a fixed point strategies to show the existence and uniqueness of continuous and discrete solutions under the assumption of sufficiently small data. The method is constructed using any compatible finite element pair (conforming or non-conforming) for velocity and pressure as dictated by Stokes inf-sup stability, while for vorticity any generic discrete space (of arbitrary order) can be used. We establish optimal a priori error estimates. Finally, we provide a set of numerical tests in 2D and 3D illustrating the behaviour of the scheme as well as verifying the theoretical convergence rates.

Neural Networks · Networking · 泛函 · PDE · Performer ·

2021 年 11 月 28 日

A semigroup method for high dimensional elliptic PDEs and eigenvalue problems based on neural networks

Haoya Li,Lexing Ying

from arxiv, 17 pages, 18 figures; v2: revised version submitted to Journal of Computational Physics

In this paper, we propose a semigroup method for solving high-dimensional elliptic partial differential equations (PDEs) and the associated eigenvalue problems based on neural networks. For the PDE problems, we reformulate the original equations as variational problems with the help of semigroup operators and then solve the variational problems with neural network (NN) parameterization. The main advantages are that no mixed second-order derivative computation is needed during the stochastic gradient descent training and that the boundary conditions are taken into account automatically by the semigroup operator. Unlike popular methods like PINN \cite{raissi2019physics} and Deep Ritz \cite{weinan2018deep} where the Dirichlet boundary condition is enforced solely through penalty functions and thus changes the true solution, the proposed method is able to address the boundary conditions without penalty functions and it gives the correct true solution even when penalty functions are added, thanks to the semigroup operator. For eigenvalue problems, a primal-dual method is proposed, efficiently resolving the constraint with a simple scalar dual variable and resulting in a faster algorithm compared with the BSDE solver \cite{han2020solving} in certain problems such as the eigenvalue problem associated with the linear Schr\"odinger operator. Numerical results are provided to demonstrate the performance of the proposed methods.

HTTPS · 樣例 · 粵港澳大灣區數字經濟研究院 · 數值分析 ·

2021 年 11 月 28 日

A transformation-based approach for solving stiff two-point boundary value problems

Denys Dragunov

from arxiv, 17 pages

A new approach for solving stiff boundary value problems for systems of ordinary differential equations is presented. Its idea essentially generalizes and extends that from arXiv:1601.04272v8. The approach can be viewed as a methodology framework that allows to enhance "stiffness resistance" capabilities of pretty much all the known numerical methods for solving two-point BVPs. The latter is demonstrated on the example of the {\it trapezoidal scheme} with the corresponding C++ source code available at \url{//github.com/imathsoft/MathSoftDevelopment}. Results of numerical experiments are provided to support the theoretical conclusions.

平滑 · 線搜索 · 優化器 · 注意力機制 · 泛函 ·

2021 年 11 月 27 日

A non-monotone smoothing Newton algorithm for solving the system of generalized absolute value equations

Cairong Chen,Dongmei Yu,Deren Han,Changfeng Ma

from arxiv, 22 pages, 2 figures

The system of generalized absolute value equations (GAVE) has attracted more and more attention in the optimization community. In this paper, by introducing a smoothing function, we develop a smoothing Newton algorithm with non-monotone line search to solve the GAVE. We show that the non-monotone algorithm is globally and locally quadratically convergent under a weaker assumption than those given in most existing algorithms for solving the GAVE. Numerical results are given to demonstrate the viability and efficiency of the approach.

規范化的 · 馬爾可夫鏈 · Performer · 樣本 · 核化 ·

2021 年 11 月 26 日

Stochastic Normalizing Flows for Inverse Problems: a Markov Chains Viewpoint

Paul Hagemann,Johannes Hertrich,Gabriele Steidl

To overcome topological constraints and improve the expressiveness of normalizing flow architectures, Wu, K\"ohler and No\'e introduced stochastic normalizing flows which combine deterministic, learnable flow transformations with stochastic sampling methods. In this paper, we consider stochastic normalizing flows from a Markov chain point of view. In particular, we replace transition densities by general Markov kernels and establish proofs via Radon-Nikodym derivatives which allows to incorporate distributions without densities in a sound way. Further, we generalize the results for sampling from posterior distributions as required in inverse problems. The performance of the proposed conditional stochastic normalizing flow is demonstrated by numerical examples.

Conformer · MoDELS · 近似 · 正則化項 · 確切的 ·

2021 年 11 月 25 日

The gradient discretisation method for the chemical reactions of biochemical systems

Yahya Alnashri,Hasan Alzubaidi

We consider a biochemical model that consists of a system of partial differential equations based on reaction terms and subject to non--homogeneous Dirichlet boundary conditions. The model is discretised using the gradient discretisation method (GDM) which is a framework covering a large class of conforming and non conforming schemes. Under classical regularity assumptions on the exact solutions, the GDM enables us to establish the existence of the model solutions in a weak sense, and strong convergence for the approximate solution and its approximate gradient. Numerical test employing a finite volume method is presented to demonstrate the behaviour of the solutions to the model.

多樣性 · PG · 優化器 · 控制器 · Continuity ·

2021 年 11 月 25 日

Diversity Policy Gradient for Sample Efficient Quality-Diversity Optimization

Thomas Pierrot,Valentin Macé,Geoffrey Cideron,Karim Beguir,Antoine Cully,Olivier Sigaud,Nicolas Perrin-Gilbert

from arxiv, refined the algorithm and the writing add theoretical groundings add baselines, ablations and analysis

A fascinating aspect of nature lies in its ability to produce a large and diverse collection of organisms that are all high-performing in their niche. By contrast, most AI algorithms focus on finding a single efficient solution to a given problem. Aiming for diversity in addition to performance is a convenient way to deal with the exploration-exploitation trade-off that plays a central role in learning. It also allows for increased robustness when the returned collection contains several working solutions to the considered problem, making it well-suited for real applications such as robotics. Quality-Diversity (QD) methods are evolutionary algorithms designed for this purpose. This paper proposes a novel algorithm, QD - PG , which combines the strength of Policy Gradient algorithms and Quality Diversity approaches to produce a collection of diverse and high-performing neural policies in continuous control environments. The main contribution of this work is the introduction of a Diversity Policy Gradient (DPG) that exploits information at the time-step level to thrive policies towards more diversity in a sample-efficient manner. Specifically, QD - PG selects neural controllers from a MAP - E lites grid and uses two gradient-based mutation operators to improve both quality and diversity, resulting in stable population updates. Our results demonstrate that QD - PG generates collections of diverse solutions that solve challenging exploration and control problems while being two orders of magnitude more sample-efficient than its evolutionary competitors.

方差 · PG · 可約的 · Performer · 估計/估計量 ·

2021 年 8 月 20 日

Settling the Variance of Multi-Agent Policy Gradients

Jakub Grudzien Kuba,Muning Wen,Yaodong Yang,Linghui Meng,Shangding Gu,Haifeng Zhang,David Henry Mguni,Jun Wang

Policy gradient (PG) methods are popular reinforcement learning (RL) methods where a baseline is often applied to reduce the variance of gradient estimates. In multi-agent RL (MARL), although the PG theorem can be naturally extended, the effectiveness of multi-agent PG (MAPG) methods degrades as the variance of gradient estimates increases rapidly with the number of agents. In this paper, we offer a rigorous analysis of MAPG methods by, firstly, quantifying the contributions of the number of agents and agents' explorations to the variance of MAPG estimators. Based on this analysis, we derive the optimal baseline (OB) that achieves the minimal variance. In comparison to the OB, we measure the excess variance of existing MARL algorithms such as vanilla MAPG and COMA. Considering using deep neural networks, we also propose a surrogate version of OB, which can be seamlessly plugged into any existing PG methods in MARL. On benchmarks of Multi-Agent MuJoCo and StarCraft challenges, our OB technique effectively stabilises training and improves the performance of multi-agent PPO and COMA algorithms by a significant margin.

坐標下降 · 優化器 · Performer · 學成 · 在線 ·

2018 年 7 月 16 日

Accelerated Randomized Coordinate Descent Algorithms for Stochastic Optimization and Online Learning

Akshita Bhandari,Chandramani Singh

from arxiv, 20 pages, 4 figures, 2 tables

We propose accelerated randomized coordinate descent algorithms for stochastic optimization and online learning. Our algorithms have significantly less per-iteration complexity than the known accelerated gradient algorithms. The proposed algorithms for online learning have better regret performance than the known randomized online coordinate descent algorithms. Furthermore, the proposed algorithms for stochastic optimization exhibit as good convergence rates as the best known randomized coordinate descent algorithms. We also show simulation results to demonstrate performance of the proposed algorithms.