韩国成年性午夜免费视频_久久男人免费视频_永久在线宅男天堂免费观看_色爱无码A综合区_中文字幕精品人妻一区二区三区_国产裸体美女无遮挡高潮网站_无码免费一区二区三区最新

We propose and analyze a new dynamical system with \textit{a closed-loop control law} in a Hilbert space $\mathcal{H}$, aiming to shed light on the acceleration phenomenon for \textit{monotone inclusion} problems, which unifies a broad class of optimization, saddle point and variational inequality (VI) problems under a single framework. Given an operator $A: \mathcal{H} \rightrightarrows \mathcal{H}$ that is maximal monotone, we study a closed-loop control system that is governed by the operator $I - (I + \lambda(t)A)^{-1}$ where $\lambda(\cdot)$ is tuned by the resolution of the algebraic equation $\lambda(t)\|(I + \lambda(t)A)^{-1}x(t) - x(t)\|^{p-1} = \theta$ for some $\theta \in (0, 1)$. Our first contribution is to prove the existence and uniqueness of a global solution via the Cauchy-Lipschitz theorem. We present a Lyapunov function that allows for establishing the weak convergence of trajectories and strong convergence results under additional conditions. We establish a global ergodic rate of $O(t^{-(p+1)/2})$ in terms of a gap function and a global pointwise rate of $O(t^{-p/2})$ in terms of a residue function. Local linear convergence is established in terms of a distance function under an error bound condition. Further, we provide an algorithmic framework based on implicit discretization of our system in a Euclidean setting, generalizing the large-step HPE framework of~\citet{Monteiro-2012-Iteration}. While the discrete-time analysis is a simplification and generalization of the previous analysis for bounded domain, it is motivated by the aforementioned continuous-time analysis, illustrating the fundamental role that the closed-loop control plays in acceleration in monotone inclusion. A highlight of our analysis is set of new results concerning $p$-th order tensor algorithms for monotone inclusion problems, which complement the recent analysis for saddle point and VI problems.

相關內容

鞍點

關注 0

在數學中，鞍(an)點(dian)(dian)或極大(da)極小點(dian)(dian)是函(han)數圖形表面上的(de)(de)一(yi)點(dian)(dian)，其正交方向上的(de)(de)斜率(導數)都為(wei)零，但它不是函(han)數的(de)(de)局部極值。鞍(an)點(dian)(dian)是在某(mou)一(yi)軸向(峰(feng)值之間)有一(yi)個相對(dui)最小的(de)(de)臨(lin)界(jie)點(dian)(dian)，在交叉(cha)軸上有一(yi)個相對(dui)最大(da)的(de)(de)臨(lin)界(jie)點(dian)(dian)。

神經元 · MoDELS · 優化器 · 樣例 · 操作 ·

2022 年 1 月 21 日

Modeling calcium dynamics in neurons with endoplasmic reticulum: existence, uniqueness and an implicit-explicit finite element scheme

Qingguang Guan,Gillian Queisser

from arxiv, 35 pages, 7 figures

Like many other biological processes, calcium dynamics in neurons containing an endoplasmic reticulum are governed by diffusion-reaction equations on interface-separated domains. Interface conditions are typically described by systems of ordinary differential equations that provide fluxes across the interfaces. Using the calcium model as an example of this class of ODE-flux boundary interface problems, we prove the existence, uniqueness and boundedness of the solution by applying comparison theorem, fundamental solution of the parabolic operator and a strategy used in Picard's existence theorem. Then we propose and analyze an efficient implicit-explicit finite element scheme which is implicit for the parabolic operator and explicit for the nonlinear terms. We show that the stability does not depend on the spatial mesh size. Also the optimal convergence rate in $H^1$ norm is obtained. Numerical experiments illustrate the theoretical results.

有限差分 · 離散化 · REST · 粵港澳大灣區數字經濟研究院 · 數值分析 ·

2022 年 1 月 21 日

A convergent numerical scheme to a parabolic equation with a nonlocal boundary condition

Bhargav Kumar Kakumani,Suman Kumar Tumuluri

from arxiv, 18 pages, 5 figure files

In this paper, a numerical scheme for a nonlinear McKendrick-von Foerster equation with diffusion in age (MV-D) with the Dirichlet boundary condition is proposed. The main idea to derive the scheme is to use the discretization based on the method of characteristics to the convection part, and the finite difference method to the rest of the terms. The nonlocal terms are dealt with the quadrature methods. As a result, an implicit scheme is obtained for the boundary value problem under consideration. The consistency and the convergence of the proposed numerical scheme is established. Moreover, numerical simulations are presented to validate the theoretical results.

控制器 · UniFormer · SimPLe · 模型評估 · 線性的 ·

2022 年 1 月 19 日

Does the multiresolution lattice Boltzmann method allow to deal with waves passing through mesh jumps?

Thomas Bellotti,Lo?c Gouarin,Benjamin Graille,Marc Massot

We consider an adaptive multiresolution-based lattice Boltzmann scheme, which we have recently introduced and studied from the perspective of the error control and the theory of the equivalent equations. This numerical strategy leads to high compression rates, error control and its high accuracy has been explained on uniform and dynamically adaptive grids. However, one key issue with non-uniform meshes within the framework of lattice Boltzmann schemes is to properly handle acoustic waves passing through a level jump of the grid. It usually yields spurious effects, in particular reflected waves. In this paper, we propose a simple mono-dimensional test-case for the linear wave equation with a fixed adapted mesh characterized by a potentially large level jump. We investigate this configuration with our original strategy and prove that we can handle and control the amplitude of the reflected wave, which is of fourth order in the space step of the finest mesh. Numerical illustrations show that the proposed strategy outperforms the existing methods in the literature and allow to assess the ability of the method to handle the mesh jump properly.

離散化 · 線性的 · 正則化項 · 模型評估 · 確切的 ·

2022 年 1 月 19 日

A linear Galerkin numerical method for a quasilinear subdiffusion equation

?ukasz P?ociniczak

from arxiv, We have altered the title, included the singular in time case, redone numerical simulations on some realistic examples, and conducted a further spell check

We couple the L1 discretization for Caputo derivative in time with spectral Galerkin method in space to devise a scheme that solves quasilinear subdiffusion equations. Both the diffusivity and the source are allowed to be nonlinear functions of the solution. We prove method's stability and convergence with spectral accuracy in space. The temporal order depends on solution's regularity in time. Further, we support our results with numerical simulations that utilize parallelism for spatial discretization. Moreover, as a side result we find asymptotic exact values of error constants along with their remainders for discretizations of Caputo derivative and fractional integrals. These constants are the smallest possible which improves the previously established results from the literature.

價值函數 · 擬凹 · Lipschitz常數 · 樣本復雜度 · Lipschitz ·

2022 年 1 月 19 日

On the Convergence Rates of Policy Gradient Methods

Lin Xiao

We consider infinite-horizon discounted Markov decision problems with finite state and action spaces. We show that with direct parametrization in the policy space, the weighted value function, although non-convex in general, is both quasi-convex and quasi-concave. While quasi-convexity helps explain the convergence of policy gradient methods to global optima, quasi-concavity hints at their convergence guarantees using arbitrarily large step sizes that are not dictated by the Lipschitz constant charactering smoothness of the value function. In particular, we show that when using geometrically increasing step sizes, a general class of policy mirror descent methods, including the natural policy gradient method and a projected Q-descent method, all enjoy a linear rate of convergence without relying on entropy or other strongly convex regularization. In addition, we develop a theory of weak gradient-mapping dominance and use it to prove sharper sublinear convergence rate of the projected policy gradient method. Finally, we also analyze the convergence rate of an inexact policy mirror descent method and estimate its sample complexity under a simple generative model.

優化器 · 平滑 · 策略評估 · SimPLe · 最優化 ·

2022 年 1 月 19 日

Lifted Primal-Dual Method for Bilinearly Coupled Smooth Minimax Optimization

Kiran Koshy Thekumparampil,Niao He,Sewoong Oh

from arxiv, Submitted for review on Oct 15, 2021. Accepted to AISTATS 2022 on Jan 18, 2022

We study the bilinearly coupled minimax problem: $\min_{x} \max_{y} f(x) + y^\top A x - h(y)$, where $f$ and $h$ are both strongly convex smooth functions and admit first-order gradient oracles. Surprisingly, no known first-order algorithms have hitherto achieved the lower complexity bound of $\Omega((\sqrt{\frac{L_x}{\mu_x}} + \frac{\|A\|}{\sqrt{\mu_x \mu_y}} + \sqrt{\frac{L_y}{\mu_y}}) \log(\frac1{\varepsilon}))$ for solving this problem up to an $\varepsilon$ primal-dual gap in the general parameter regime, where $L_x, L_y,\mu_x,\mu_y$ are the corresponding smoothness and strongly convexity constants. We close this gap by devising the first optimal algorithm, the Lifted Primal-Dual (LPD) method. Our method lifts the objective into an extended form that allows both the smooth terms and the bilinear term to be handled optimally and seamlessly with the same primal-dual framework. Besides optimality, our method yields a desirably simple single-loop algorithm that uses only one gradient oracle call per iteration. Moreover, when $f$ is just convex, the same algorithm applied to a smoothed objective achieves the nearly optimal iteration complexity. We also provide a direct single-loop algorithm, using the LPD method, that achieves the iteration complexity of $O(\sqrt{\frac{L_x}{\varepsilon}} + \frac{\|A\|}{\sqrt{\mu_y \varepsilon}} + \sqrt{\frac{L_y}{\varepsilon}})$. Numerical experiments on quadratic minimax problems and policy evaluation problems further demonstrate the fast convergence of our algorithm in practice.

MoDELS · 控制器 · 生成模型 · 樣本 · 聯合分布 ·

2021 年 10 月 21 日

Controllable and Compositional Generation with Latent-Space Energy-Based Models

Weili Nie,Arash Vahdat,Anima Anandkumar

from arxiv, 32 pages, NeurIPS 2021

Controllable generation is one of the key requirements for successful adoption of deep generative models in real-world applications, but it still remains as a great challenge. In particular, the compositional ability to generate novel concept combinations is out of reach for most current models. In this work, we use energy-based models (EBMs) to handle compositional generation over a set of attributes. To make them scalable to high-resolution image generation, we introduce an EBM in the latent space of a pre-trained generative model such as StyleGAN. We propose a novel EBM formulation representing the joint distribution of data and attributes together, and we show how sampling from it is formulated as solving an ordinary differential equation (ODE). Given a pre-trained generator, all we need for controllable generation is to train an attribute classifier. Sampling with ODEs is done efficiently in the latent space and is robust to hyperparameters. Thus, our method is simple, fast to train, and efficient to sample. Experimental results show that our method outperforms the state-of-the-art in both conditional sampling and sequential editing. In compositional generation, our method excels at zero-shot generation of unseen attribute combinations. Also, by composing energy functions with logical operators, this work is the first to achieve such compositionality in generating photo-realistic images of resolution 1024x1024.

邊緣化 · 對率損失 · FAST · 線性分類 · Performer ·

2021 年 7 月 1 日

Fast Margin Maximization via Dual Acceleration

Ziwei Ji,Nathan Srebro,Matus Telgarsky

from arxiv, ICML 2021

We present and analyze a momentum-based gradient method for training linear classifiers with an exponentially-tailed loss (e.g., the exponential or logistic loss), which maximizes the classification margin on separable data at a rate of $\widetilde{\mathcal{O}}(1/t^2)$. This contrasts with a rate of $\mathcal{O}(1/\log(t))$ for standard gradient descent, and $\mathcal{O}(1/t)$ for normalized gradient descent. This momentum-based method is derived via the convex dual of the maximum-margin problem, and specifically by applying Nesterov acceleration to this dual, which manages to result in a simple and intuitive method in the primal. This dual view can also be used to derive a stochastic variant, which performs adaptive non-uniform sampling via the dual variables.

優化器 · 值迭代 · CASES · 估計/估計量 · 路徑 ·

2021 年 4 月 22 日

Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret

Jean Tarbouriech,Runlong Zhou,Simon S. Du,Matteo Pirotta,Michal Valko,Alessandro Lazaric

We study the problem of learning in the stochastic shortest path (SSP) setting, where an agent seeks to minimize the expected cost accumulated before reaching a goal state. We design a novel model-based algorithm EB-SSP that carefully skews the empirical transitions and perturbs the empirical costs with an exploration bonus to guarantee both optimism and convergence of the associated value iteration scheme. We prove that EB-SSP achieves the minimax regret rate $\widetilde{O}(B_{\star} \sqrt{S A K})$, where $K$ is the number of episodes, $S$ is the number of states, $A$ is the number of actions and $B_{\star}$ bounds the expected cumulative cost of the optimal policy from any state, thus closing the gap with the lower bound. Interestingly, EB-SSP obtains this result while being parameter-free, i.e., it does not require any prior knowledge of $B_{\star}$, nor of $T_{\star}$ which bounds the expected time-to-goal of the optimal policy from any state. Furthermore, we illustrate various cases (e.g., positive costs, or general costs when an order-accurate estimate of $T_{\star}$ is available) where the regret only contains a logarithmic dependence on $T_{\star}$, thus yielding the first horizon-free regret bound beyond the finite-horizon MDP setting.

優化器 · Extensibility · 對偶問題 · 平滑 · INTERACT ·

2017 年 12 月 1 日

Optimal Algorithms for Distributed Optimization

César A. Uribe,Soomin Lee,Alexander Gasnikov,Angelia Nedi?

In this paper, we study the optimal convergence rate for distributed convex optimization problems in networks. We model the communication restrictions imposed by the network as a set of affine constraints and provide optimal complexity bounds for four different setups, namely: the function $F(\xb) \triangleq \sum_{i=1}^{m}f_i(\xb)$ is strongly convex and smooth, either strongly convex or smooth or just convex. Our results show that Nesterov's accelerated gradient descent on the dual problem can be executed in a distributed manner and obtains the same optimal rates as in the centralized version of the problem (up to constant or logarithmic factors) with an additional cost related to the spectral gap of the interaction matrix. Finally, we discuss some extensions to the proposed setup such as proximal friendly functions, time-varying graphs, improvement of the condition numbers.