久久久久久久精品少妇9999_91超碰人妻偷情在线播放_最新2021国产精品国产精品_日本乱人伦片中文字幕三区_自拍亚洲一区制服丝袜欧美_欧美日韩一区_久久久久天天射久久久天天

We introduce the notion of universal graphs as a tool for constructing algorithms solving games of infinite duration such as parity games and mean payoff games. In the first part we develop the theory of universal graphs, with two goals: showing an equivalence and normalisation result between different recently introduced related models, and constructing generic value iteration algorithms for any positionally determined objective. In the second part we give four applications: to parity games, to mean payoff games, and to combinations of them (in the form of disjunctions of objectives). For each of these four cases we construct algorithms achieving or improving over the best known time and space complexity.

相關內容

圖

關注 6

近似 · 隨機變量 · 視覺識別系統 · Processing（編程語言） · 類別 ·

2021 年 10 月 12 日

A monotone scheme for nonlinear partial integro-differential equations with the convergence rate of $α$-stable limit theorem under sublinear expectation

Mingshang Hu,Lianzi Jiang,Gechun Liang

from arxiv, 24 pages

In this paper, we propose a monotone approximation scheme for a class of fully nonlinear partial integro-differential equations (PIDEs) which characterize the nonlinear $\alpha$-stable L\'{e}vy processes under sublinear expectation space with $\alpha \in(1,2)$. Two main results are obtained: (i) the error bounds for the monotone approximation scheme of nonlinear PIDEs, and (ii) the convergence rates of a generalized central limit theorem of Bayraktar-Munk for $\alpha$-stable random variables under sublinear expectation. Our proofs use and extend techniques introduced by Krylov and Barles-Jakobsen.

多重集 · 情景 · 連結 · 線性的 · MoDELS ·

2021 年 10 月 11 日

Free Commutative Monoids in Homotopy Type Theory

Vikraman Choudhury,Marcelo Fiore

from arxiv, Under submission

We develop a constructive theory of finite multisets, defining them as free commutative monoids in Homotopy Type Theory. We formalise two algebraic presentations of this construction using 1-HITs, establishing the universal property for each and thereby their equivalence. These presentations correspond to equational theories including a commutation axiom. In this setting, we prove important structural combinatorial properties of singleton multisets arising from concatenations and projections of multisets. This is done in generality, without assuming decidable equality on the carrier set. Further, as applications, we present a constructive formalisation of the relational model of differential linear logic and use it to characterise the equality type of multisets. This leads us to the introduction of a novel conditional equational presentation of the finite-multiset construction.

泛化理論 · Neural Networks · 確定性模型 · INFORMS · state-of-the-art ·

2021 年 10 月 11 日

A General Framework for the Disintegration of PAC-Bayesian Bounds

Paul Viallard,Pascal Germain,Amaury Habrard,Emilie Morvant

PAC-Bayesian bounds are known to be tight and informative when studying the generalization ability of randomized classifiers. However, when applied to some family of deterministic models such as neural networks, they require a loose and costly derandomization step. As an alternative to this step, we introduce new PAC-Bayesian generalization bounds that have the originality to provide disintegrated bounds, i.e., they give guarantees over one single hypothesis instead of the usual averaged analysis. Our bounds are easily optimizable and can be used to design learning algorithms. We illustrate the interest of our result on neural networks and show a significant practical improvement over the state-of-the-art framework.

貪心逐層預訓練 · 貪心 · 結構化學習 · 學成 · CC ·

2021 年 10 月 10 日

Structure learning in polynomial time: Greedy algorithms, Bregman information, and exponential families

Goutham Rajendran,Bohdan Kivva,Ming Gao,Bryon Aragam

from arxiv, 36 pages, 9 figures

Greedy algorithms have long been a workhorse for learning graphical models, and more broadly for learning statistical models with sparse structure. In the context of learning directed acyclic graphs, greedy algorithms are popular despite their worst-case exponential runtime. In practice, however, they are very efficient. We provide new insight into this phenomenon by studying a general greedy score-based algorithm for learning DAGs. Unlike edge-greedy algorithms such as the popular GES and hill-climbing algorithms, our approach is vertex-greedy and requires at most a polynomial number of score evaluations. We then show how recent polynomial-time algorithms for learning DAG models are a special case of this algorithm, thereby illustrating how these order-based algorithms can be rigourously interpreted as score-based algorithms. This observation suggests new score functions and optimality conditions based on the duality between Bregman divergences and exponential families, which we explore in detail. Explicit sample and computational complexity bounds are derived. Finally, we provide extensive experiments suggesting that this algorithm indeed optimizes the score in a variety of settings.

回合 · Performer · 參數空間 · 優化器 · 在線 ·

2021 年 10 月 10 日

Competitive Perimeter Defense of Conical Environments

Shivam Bajaj,Eric Torng,Shaunak D. Bopardikar,Alexander Von Moll,Isaac Weintraub,Eloy Garcia,David W. Casbeer

We consider a perimeter defense problem in a planar conical environment in which a single vehicle, having a finite capture radius, aims to defend a concentric perimeter from mobile intruders. The intruders are arbitrarily released at the circumference of the environment and move radially toward the perimeter with fixed speed. We present a competitive analysis approach to this problem by measuring the performance of multiple online algorithms for the vehicle against arbitrary inputs, relative to an optimal offline algorithm that has access to all future inputs. In particular, we first establish a necessary condition on the parameter space to guarantee finite competitiveness of any algorithm, and then characterize a parameter regime in which the competitive ratio is guaranteed to be at least 2 for any algorithm. We then design and analyze three online algorithms and characterize parameter regimes for which they have finite competitive ratios. Specifically, our first two algorithms are provably 1, and 2-competitive, respectively, whereas our third algorithm exhibits a finite competitive ratio that depends on the problem parameters. Finally, we provide numerous parameter space plots providing insights into the relative performance of our algorithms.

優化器 · 泛函 · 方陣 · 控制器 · 歐氏空間 ·

2021 年 10 月 9 日

A Control-Theoretic Perspective on Optimal High-Order Optimization

Tianyi Lin,Michael. I. Jordan

from arxiv, Accepted by Mathematical Programming Series A; 45 pages

We provide a control-theoretic perspective on optimal tensor algorithms for minimizing a convex function in a finite-dimensional Euclidean space. Given a function $\Phi: \mathbb{R}^d \rightarrow \mathbb{R}$ that is convex and twice continuously differentiable, we study a closed-loop control system that is governed by the operators $\nabla \Phi$ and $\nabla^2 \Phi$ together with a feedback control law $\lambda(\cdot)$ satisfying the algebraic equation $(\lambda(t))^p\|\nabla\Phi(x(t))\|^{p-1} = \theta$ for some $\theta \in (0, 1)$. Our first contribution is to prove the existence and uniqueness of a local solution to this system via the Banach fixed-point theorem. We present a simple yet nontrivial Lyapunov function that allows us to establish the existence and uniqueness of a global solution under certain regularity conditions and analyze the convergence properties of trajectories. The rate of convergence is $O(1/t^{(3p+1)/2})$ in terms of objective function gap and $O(1/t^{3p})$ in terms of squared gradient norm. Our second contribution is to provide two algorithmic frameworks obtained from discretization of our continuous-time system, one of which generalizes the large-step A-HPE framework and the other of which leads to a new optimal $p$-th order tensor algorithm. While our discrete-time analysis can be seen as a simplification and generalization of~\citet{Monteiro-2013-Accelerated}, it is largely motivated by the aforementioned continuous-time analysis, demonstrating the fundamental role that the feedback control plays in optimal acceleration and the clear advantage that the continuous-time perspective brings to algorithmic design. A highlight of our analysis is that we show that all of the $p$-th order optimal tensor algorithms that we discuss minimize the squared gradient norm at a rate of $O(k^{-3p})$, which complements the recent analysis.

可辨認的 · 情景 · 路徑 ·

2021 年 10 月 9 日

Zig-Zag Modules: Cosheaves and K-Theory

Ryan E. Grady,Anna Schenfisch

from arxiv, 26 pages, comments welcome

Persistence modules have a natural home in the setting of stratified spaces and constructible cosheaves. In this article, we first give explicit constructible cosheaves for common data-motivated persistence modules, namely, for modules that arise from zig-zag filtrations (including monotone filtrations), and for augmented persistence modules (which encode the data of instantaneous events). We then identify an equivalence of categories between a particular notion of zig-zag modules and the combinatorial entrance path category on stratified $\mathbb{R}$. Finally, we compute the algebraic $K$-theory of generalized zig-zag modules and describe connections to both Euler curves and $K_0$ of the monoid of persistence diagrams as described by Bubenik and Elchesen.

離散化 · 泛函 · 鞍點 · Lipschitz · Performer ·

2021 年 10 月 8 日

The Johnson-Nédélec FEM-BEM Coupling for magnetostatic problems in the isogeometric framework

Mehdi Elasmi,Christoph Erath,Stefan Kurz

We consider a Johnson-N\'ed\'elec FEM-BEM coupling, which is a direct and non-symmetric coupling of finite and boundary element methods, in order to solve interface problems for the magnetostatic Maxwell's equations with the magnetic vector potential ansatz. In the FEM-domain, equations may be non-linear, whereas they are exclusively linear in the BEM-part to guarantee the existence of a fundamental solution. First, the weak problem is formulated in quotient spaces to avoid resolving to a saddle point problem. Second, we establish in this setting well-posedness of the arising problem using the framework of Lipschitz and strongly monotone operators as well as a stability result for a special type of non-linearity, which is typically considered in magnetostatic applications. Then, the discretization is performed in the isogeometric context, i.e., the same type of basis functions that are used for geometry design are considered as ansatz functions for the discrete setting. In particular, NURBS are employed for geometry considerations, and B-Splines, which can be understood as a special type of NURBS, for analysis purposes. In this context, we derive a priori estimates w.r.t. h-refinement, and point out to an interesting behavior of BEM, which consists in an amelioration of the convergence rates, when a functional of the solution is evaluated in the exterior BEM-domain. This improvement may lead to a doubling of the convergence rate under certain assumptions. Finally, we end the paper with a numerical example to illustrate the theoretical results, along with a conclusion and an outlook.

塑造 · Continuity · MoDELS · 相似度 · 情景 ·

2021 年 10 月 8 日

A constructive theory of shape

Vladimir García-Morales

from arxiv, 19 pages, 10 figures. Discussion on a convexity property added. Accepted to Chaos. Sol. Fract

We formulate a theory of shape valid for objects of arbitrary dimension whose contours are path connected. We apply this theory to the design and modeling of viable trajectories of complex dynamical systems. Infinite families of qualitatively similar shapes are constructed giving as input a finite ordered set of characteristic points (landmarks) and the value of a continuous parameter $\kappa \in (0,\infty)$. We prove that all shapes belonging to the same family are located within the convex hull of the landmarks. The theory is constructive in the sense that it provides a systematic means to build a mathematical model for any shape taken from the physical world. We illustrate this with a variety of examples: (chaotic) time series, plane curves, space filling curves, knots and strange attractors.

樣本復雜度 · 策略搜索 · 估計/估計量 · 泛函 · 評論員 ·

2021 年 10 月 7 日

On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function Approximation

Harshat Kumar,Alec Koppel,Alejandro Ribeiro

Reinforcement learning, mathematically described by Markov Decision Problems, may be approached either through dynamic programming or policy search. Actor-critic algorithms combine the merits of both approaches by alternating between steps to estimate the value function and policy gradient updates. Due to the fact that the updates exhibit correlated noise and biased gradient updates, only the asymptotic behavior of actor-critic is known by connecting its behavior to dynamical systems. This work puts forth a new variant of actor-critic that employs Monte Carlo rollouts during the policy search updates, which results in controllable bias that depends on the number of critic evaluations. As a result, we are able to provide for the first time the convergence rate of actor-critic algorithms when the policy search step employs policy gradient, agnostic to the choice of policy evaluation technique. In particular, we establish conditions under which the sample complexity is comparable to stochastic gradient method for non-convex problems or slower as a result of the critic estimation error, which is the main complexity bottleneck. These results hold in continuous state and action spaces with linear function approximation for the value function. We then specialize these conceptual results to the case where the critic is estimated by Temporal Difference, Gradient Temporal Difference, and Accelerated Gradient Temporal Difference. These learning rates are then corroborated on a navigation problem involving an obstacle, providing insight into the interplay between optimization and generalization in reinforcement learning.