国产成人精品三级在线,国产三级片视频欣赏,久久亚免费偷拍视频,偷偷久久久久久久久网站,久久久人人妻人人爽人人射

Semi-lagrangian schemes for discretization of the dynamic programming principle are based on a time discretization projected on a state-space grid. The use of a structured grid makes this approach not feasible for high-dimensional problems due to the curse of dimensionality. Here, we present a new approach for infinite horizon optimal control problems where the value function is computed using Radial Basis Functions (RBF) by the Shepard's moving least squares approximation method on scattered grids. We propose a new method to generate a scattered mesh driven by the dynamics and the selection of the shape parameter in the RBF using an optimization routine. This mesh will help to localize the problem and approximate the dynamic programming principle in high dimension. Error estimates for the value function are also provided. Numerical tests for high dimensional problems will show the effectiveness of the proposed method.

相關內容

價值函數

關注 0

優化器 · 正則化項 · Extensibility · 近似 · 生成模型 ·

2021 年 10 月 6 日

On Transportation of Mini-batches: A Hierarchical Approach

Khai Nguyen,Dang Nguyen,Quoc Nguyen,Tung Pham,Hung Bui,Dinh Phung,Trung Le,Nhat Ho

from arxiv, 43 pages, 17 figures

Mini-batch optimal transport (m-OT) has been successfully used in practical applications that involve probability measures with a very high number of supports. The m-OT solves several smaller optimal transport problems and then returns the average of their costs and transportation plans. Despite its scalability advantage, the m-OT does not consider the relationship between mini-batches which leads to undesirable estimation. Moreover, the m-OT does not approximate a proper metric between probability measures since the identity property is not satisfied. To address these problems, we propose a novel mini-batching scheme for optimal transport, named Batch of Mini-batches Optimal Transport (BoMb-OT), that finds the optimal coupling between mini-batches and it can be seen as an approximation to a well-defined distance on the space of probability measures. Furthermore, we show that the m-OT is a limit of the entropic regularized version of the BoMb-OT when the regularized parameter goes to infinity. Finally, we present the new algorithms of the BoMb-OT in various applications, such as deep generative models and deep domain adaptation. From extensive experiments, we observe that the BoMb-OT achieves a favorable performance in deep learning models such as deep generative models and deep domain adaptation. In other applications such as approximate Bayesian computation, color transfer, and gradient flow, the BoMb-OT also yields either a lower quantitative result or a better qualitative result than the m-OT.

優化器 · 泛函 · 方陣 · 控制器 · 歐氏空間 ·

2021 年 10 月 6 日

A Control-Theoretic Perspective on Optimal High-Order Optimization

Tianyi Lin,Michael. I. Jordan

from arxiv, Accepted by Mathematical Programming Series A; 45 pages

We provide a control-theoretic perspective on optimal tensor algorithms for minimizing a convex function in a finite-dimensional Euclidean space. Given a function $\Phi: \mathbb{R}^d \rightarrow \mathbb{R}$ that is convex and twice continuously differentiable, we study a closed-loop control system that is governed by the operators $\nabla \Phi$ and $\nabla^2 \Phi$ together with a feedback control law $\lambda(\cdot)$ satisfying the algebraic equation $(\lambda(t))^p\|\nabla\Phi(x(t))\|^{p-1} = \theta$ for some $\theta \in (0, 1)$. Our first contribution is to prove the existence and uniqueness of a local solution to this system via the Banach fixed-point theorem. We present a simple yet nontrivial Lyapunov function that allows us to establish the existence and uniqueness of a global solution under certain regularity conditions and analyze the convergence properties of trajectories. The rate of convergence is $O(1/t^{(3p+1)/2})$ in terms of objective function gap and $O(1/t^{3p})$ in terms of squared gradient norm. Our second contribution is to provide two algorithmic frameworks obtained from discretization of our continuous-time system, one of which generalizes the large-step A-HPE framework and the other of which leads to a new optimal $p$-th order tensor algorithm. While our discrete-time analysis can be seen as a simplification and generalization of~\citet{Monteiro-2013-Accelerated}, it is largely motivated by the aforementioned continuous-time analysis, demonstrating the fundamental role that the feedback control plays in optimal acceleration and the clear advantage that the continuous-time perspective brings to algorithmic design. A highlight of our analysis is that we show that all of the $p$-th order optimal tensor algorithms that we discuss minimize the squared gradient norm at a rate of $O(k^{-3p})$, which complements the recent analysis.

估計/估計量 · 離散化 · Conformer · 確切的 · 有向 ·

2021 年 10 月 5 日

Improved error estimates of hybridizable interior penalty methods using a variable penalty for highly anisotropic diffusion problems

Gregory Etangsale,Marwan Fahs,Vincent Fontaine,Nalitiana Rajaonison

from arxiv, 11 pages, 7 figures, 1 table

In this paper, we derive improved a priori error estimates for families of hybridizable interior penalty discontinuous Galerkin (H-IP) methods using a variable penalty for second-order elliptic problems. The strategy is to use a penalization function of the form $\mathcal{O}(1/h^{1+\delta})$, where $h$ denotes the mesh size and $\delta$ is a user-dependent parameter. We then quantify its direct impact on the convergence analysis, namely, the (strong) consistency, discrete coercivity, and boundedness (with $h^{\delta}$-dependency), and we derive updated error estimates for both discrete energy- and $L^{2}$-norms. The originality of the error analysis relies specifically on the use of conforming interpolants of the exact solution. All theoretical results are supported by numerical evidence.

控制器 · 推斷 · 泛函 · 學成 · Continuity ·

2021 年 10 月 4 日

Inverse Reinforcement Learning: A Control Lyapunov Approach

Samuel Tesfazgi,Armin Lederer,Sandra Hirche

from arxiv, This work has been accepted for presentation at, and publication in the proceedings of, the 2021 IEEE Conference on Decision and Control (CDC)

Inferring the intent of an intelligent agent from demonstrations and subsequently predicting its behavior, is a critical task in many collaborative settings. A common approach to solve this problem is the framework of inverse reinforcement learning (IRL), where the observed agent, e.g., a human demonstrator, is assumed to behave according to an intrinsic cost function that reflects its intent and informs its control actions. In this work, we reformulate the IRL inference problem to learning control Lyapunov functions (CLF) from demonstrations by exploiting the inverse optimality property, which states that every CLF is also a meaningful value function. Moreover, the derived CLF formulation directly guarantees stability of inferred control policies. We show the flexibility of our proposed method by learning from goal-directed movement demonstrations in a continuous environment.

優化器 · 估計/估計量 · Integration · CASE · 確切的 ·

2021 年 10 月 4 日

Analysis of nonconforming IFE methods and a new scheme for elliptic interface problems

Haifeng Ji,Feng Wang,Jinru Chen,Zhilin Li

In this paper, an important discovery has been found for nonconforming immersed finite element (IFE) methods using the integral values on edges as degrees of freedom for solving elliptic interface problems. We show that those IFE methods without penalties are not guaranteed to converge optimally if the tangential derivative of the exact solution and the jump of the coefficient are not zero on the interface. A nontrivial counter example is also provided to support our theoretical analysis. To recover the optimal convergence rates, we develop a new nonconforming IFE method with additional terms locally on interface edges. The new method is parameter-free which removes the limitation of the conventional partially penalized IFE method. We show the IFE basis functions are unisolvent on arbitrary triangles which is not considered in the literature. Furthermore, different from multipoint Taylor expansions, we derive the optimal approximation capabilities of both the Crouzeix-Raviart and the rotated-$Q_1$ IFE spaces via a unified approach which can handle the case of variable coefficients easily. Finally, optimal error estimates in both $H^1$- and $L^2$- norms are proved and confirmed with numerical experiments.

優化器 · 近似 · 控制器 · 離散化 · 分段 ·

2021 年 10 月 4 日

Discrete-time approximation for stochastic optimal control problems under the $G$-expectation framework

Lianzi Jiang

from arxiv, 20 pages

In this paper, we propose a class of discrete-time approximation schemes for stochastic optimal control problems under the $G$-expectation framework. The proposed schemes are constructed recursively based on piecewise constant policy. We prove the convergence of the discrete schemes and determine the convergence rates. Several numerical examples are presented to illustrate the effectiveness of the obtained results.

泛函 · 線性的 · 線性模型 · 統計量 · 再生核希爾伯特空間 ·

2021 年 10 月 3 日

A new RKHS-based global testing for functional linear model

Jianjun Xu,Wenquan Cui

from arxiv, there was an error

This article studies global testing of the slope function in functional linear regression model in the framework of reproducing kernel Hilbert space. We propose a new testing statistic based on smoothness regularization estimators. The asymptotic distribution of the testing statistic is established under null hypothesis. It is shown that the null asymptotic distribution is determined jointly by the reproducing kernel and the covariance function. Our theoretical analysis shows that the proposed testing is consistent over a class of smooth local alternatives. Despite the generality of the method of regularization, we show the procedure is easily implementable. Numerical examples are provided to demonstrate the empirical advantages over the competing methods.

離散化 · 拉格朗日乘子 · Performer · prototype · 相互獨立的 ·

2021 年 10 月 3 日

Fully implicit local time-stepping methods for advection-diffusion problems in mixed formulations

Thi-Thao-Phuong Hoang

from arxiv, 33 pages

This paper is concerned with numerical solution of transport problems in heterogeneous porous media. A semi-discrete continuous-in-time formulation of the linear advection-diffusion equation is obtained by using a mixed hybrid finite element method, in which the flux variable represents both the advective and diffusive flux, and the Lagrange multiplier arising from the hybridization is used for the discretization of the advective term. Based on global-in-time and nonoverlapping domain decomposition, we propose two implicit local time-stepping methods to solve the semi-discrete problem. The first method uses the time-dependent Steklov-Poincar\'e type operator and the second uses the optimized Schwarz waveform relaxation (OSWR) with Robin transmission conditions. For each method, we formulate a space-time interface problem which is solved iteratively. Each iteration involves solving the subdomain problems independently and globally in time; thus, different time steps can be used in the subdomains. The convergence of the fully discrete OSWR algorithm with nonmatching time grids is proved. Numerical results for problems with various Pecl\'et numbers and discontinuous coefficients, including a prototype for the simulation of the underground storage of nuclear waste, are presented to illustrate the performance of the proposed local time-stepping methods.

Performer · 近似 · 樣本 · Continuity · 協方差矩陣 ·

2021 年 10 月 1 日

Best Principal Submatrix Selection for the Maximum Entropy Sampling Problem: Scalable Algorithms and Performance Guarantees

Yongchun Li,Weijun Xie

from arxiv, 62 pages

This paper studies a classic maximum entropy sampling problem (MESP), which aims to select the most informative principal submatrix of a prespecified size from a covariance matrix. MESP has been widely applied to many areas, including healthcare, power system, manufacturing and data science. By investigating its Lagrangian dual and primal characterization, we derive a novel convex integer program for MESP and show that its continuous relaxation yields a near-optimal solution. The results motivate us to study an efficient sampling algorithm and develop its approximation bound for MESP, which improves the best-known bound in literature. We then provide an efficient deterministic implementation of the sampling algorithm with the same approximation bound. By developing new mathematical tools for the singular matrices and analyzing the Lagrangian dual of the proposed convex integer program, we investigate the widely-used local search algorithm and prove its first-known approximation bound for MESP. The proof techniques further inspire us with an efficient implementation of the local search algorithm. Our numerical experiments demonstrate that these approximation algorithms can efficiently solve medium-sized and large-scale instances to near-optimality. Our proposed algorithms are coded and released as open-source software. Finally, we extend the analyses to the A-Optimal MESP (A-MESP), where the objective is to minimize the trace of the inverse of the selected principal submatrix.

可約的 · 控制器 · 線性的 · MoDELS · Principle ·

2021 年 10 月 1 日

Dynamic Models of Spherical Parallel Robots for Model-Based Control Schemes

Ali Hassani,Abbas Bataleblu,S. A. Khalilpour,Hamid D. Taghirad,Philippe Cardou

from arxiv, 21 pages, 7 figures

In this paper, derivation of different forms of dynamic formulation of spherical parallel robots (SPRs) is investigated. These formulations include the explicit dynamic forms, linear regressor, and Slotine-Li (SL) regressor, which are required for the design and implementation of the vast majority of model-based controllers and dynamic parameters identification schemes. To this end, the implicit dynamic of SPRs is first formulated using the principle of virtual work in task-space, and then by using an extension, their explicit dynamic formulation is derived. The dynamic equation is then analytically reformulated into linear and S-L regression form with respect to the inertial parameters, and by using the Gauss-Jordan procedure, it is reduced to a unique and closed-form structure. Finally, to illustrate the effectiveness of the proposed method, two different SPRs, namely, the ARAS-Diamond, and the 3-RRR, are examined as the case studies. The obtained results are verified by using the MSC-ADAMS software, and are shared to interested audience for public access.