亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<li id='ucaMk'></li>

_{^{<dd id='SEzpA'><tbody id='2hIey'><td id='nXNYv'><optgroup id='rOyqL'><strong id='Z2C3O'></strong></optgroup><address id='zjjqM'><ul id='HI1p3'></ul></address><big id='1sSs9'></big></td><table id='7ap6T'></table></tbody><pre id='7eVzr'></pre></dd><span id='CHOzH'><b id='1bGLb'></b></span>}}


<dfn id='ZRlPA'><optgroup id='ZhjjW'></optgroup></dfn><tfoot id='cXL6J'><bdo id='ddT4G'><div id='llECA'></div><i id='iLE2d'><dt id='9nZZw'></dt></i></bdo></tfoot>

_{<fieldset id='3mGiA'></fieldset>}

·

離散化 · 線性的 · 優化器 · 二次規劃 · Microsoft Surface ·

2022 年 12 月 31 日

A Linear Time Algorithm for the Optimal Discrete IRS Beamforming

Shuyi Ren,Kaiming Shen,Xin Li,Xin Chen,Zhi-Quan Luo

from arxiv, 5 pages

It remains an open problem to find the optimal configuration of phase shifts under the discrete constraint for intelligent reflecting surface (IRS) in polynomial time. The above problem is widely believed to be difficult because it is not linked to any known combinatorial problems that can be solved efficiently. The branch-and-bound algorithms and the approximation algorithms constitute the best results in this area. Nevertheless, this work shows that the global optimum can actually be reached in linear time on average in terms of the number of reflective elements (REs) of IRS. The main idea is to geometrically interpret the discrete beamforming problem as choosing the optimal point on the unit circle. Although the number of possible combinations of phase shifts grows exponentially with the number of REs, it turns out that there are only a linear number of circular arcs that possibly contain the optimal point. Furthermore, the proposed algorithm can be viewed as a novel approach to a special case of the discrete quadratic program (QP).

相關內容

離散化

離(li)散化(hua)

Minimax · 線性的 · Markov · 泛函 · 優化器 ·

2023 年 3 月 1 日

Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game

Wei Xiong,Han Zhong,Chengshuai Shi,Cong Shen,Liwei Wang,Tong Zhang

Offline reinforcement learning (RL) aims at learning an optimal strategy using a pre-collected dataset without further interactions with the environment. While various algorithms have been proposed for offline RL in the previous literature, the minimax optimality has only been (nearly) established for tabular Markov decision processes (MDPs). In this paper, we focus on offline RL with linear function approximation and propose a new pessimism-based algorithm for offline linear MDP. At the core of our algorithm is the uncertainty decomposition via a reference function, which is new in the literature of offline RL under linear function approximation. Theoretical analysis demonstrates that our algorithm can match the performance lower bound up to logarithmic factors. We also extend our techniques to the two-player zero-sum Markov games (MGs), and establish a new performance lower bound for MGs, which tightens the existing result, and verifies the nearly minimax optimality of the proposed algorithm. To the best of our knowledge, these are the first computationally efficient and nearly minimax optimal algorithms for offline single-agent MDPs and MGs with linear function approximation.

Integration · 控制器 · 重要性采樣 · Performer · 優化器 ·

2023 年 3 月 1 日

Model Predictive Optimized Path Integral Strategies

Dylan M. Asmar,Ransalu Senanayake,Shawn Manuel,Mykel J. Kochenderfer

from arxiv, Repository: //github.com/sisl/MPOPIS. Accepted to ICRA 2023

We generalize the derivation of model predictive path integral control (MPPI) to allow for a single joint distribution across controls in the control sequence. This reformation allows for the implementation of adaptive importance sampling (AIS) algorithms into the original importance sampling step while still maintaining the benefits of MPPI such as working with arbitrary system dynamics and cost functions. The benefit of optimizing the proposal distribution by integrating AIS at each control step is demonstrated in simulated environments including controlling multiple cars around a track. The new algorithm is more sample efficient than MPPI, achieving better performance with fewer samples. This performance disparity grows as the dimension of the action space increases. Results from simulations suggest the new algorithm can be used as an anytime algorithm, increasing the value of control at each iteration versus relying on a large set of samples.

AdaGrad · 優化器 · Lipschitz · 生成方法 · 線性的 ·

2023 年 2 月 28 日

High Probability Convergence of Stochastic Gradient Methods

Zijian Liu,Ta Duy Nguyen,Thien Hang Nguyen,Alina Ene,Huy Lê Nguyen

from arxiv, This paper subsumes arXiv paper arxiv:2210.00679

In this work, we describe a generic approach to show convergence with high probability for both stochastic convex and non-convex optimization with sub-Gaussian noise. In previous works for convex optimization, either the convergence is only in expectation or the bound depends on the diameter of the domain. Instead, we show high probability convergence with bounds depending on the initial distance to the optimal solution. The algorithms use step sizes analogous to the standard settings and are universal to Lipschitz functions, smooth functions, and their linear combinations. This method can be applied to the non-convex case. We demonstrate an $O((1+\sigma^{2}\log(1/\delta))/T+\sigma/\sqrt{T})$ convergence rate when the number of iterations $T$ is known and an $O((1+\sigma^{2}\log(T/\delta))/\sqrt{T})$ convergence rate when $T$ is unknown for SGD, where $1-\delta$ is the desired success probability. These bounds improve over existing bounds in the literature. Additionally, we demonstrate that our techniques can be used to obtain high probability bound for AdaGrad-Norm (Ward et al., 2019) that removes the bounded gradients assumption from previous works. Furthermore, our technique for AdaGrad-Norm extends to the standard per-coordinate AdaGrad algorithm (Duchi et al., 2011), providing the first noise-adapted high probability convergence for AdaGrad.

泛化理論 · 泛化誤差 · 優化器 · 泛化誤差上界 · 極大 ·

2023 年 2 月 28 日

Asymptotically Optimal Generalization Error Bounds for Noisy, Iterative Algorithms

Ibrahim Issa,Amedeo Roberto Esposito,Michael Gastpar

from arxiv, Submitted to COLT 2023

We adopt an information-theoretic framework to analyze the generalization behavior of the class of iterative, noisy learning algorithms. This class is particularly suitable for study under information-theoretic metrics as the algorithms are inherently randomized, and it includes commonly used algorithms such as Stochastic Gradient Langevin Dynamics (SGLD). Herein, we use the maximal leakage (equivalently, the Sibson mutual information of order infinity) metric, as it is simple to analyze, and it implies both bounds on the probability of having a large generalization error and on its expected value. We show that, if the update function (e.g., gradient) is bounded in $L_2$-norm, then adding isotropic Gaussian noise leads to optimal generalization bounds: indeed, the input and output of the learning algorithm in this case are asymptotically statistically independent. Furthermore, we demonstrate how the assumptions on the update function affect the optimal (in the sense of minimizing the induced maximal leakage) choice of the noise. Finally, we compute explicit tight upper bounds on the induced maximal leakage for several scenarios of interest.

線性的 · 優化器 · 分離的 · 正則化項 · 稀疏 ·

2023 年 2 月 28 日

An active-set method for sparse approximations. Part I: Separable $\ell_1$ terms

Spyridon Pougkakiotis,Jacek Gondzio,Dionysios S. Kalogerias

In this paper we present an active-set method for the solution of $\ell_1$-regularized convex quadratic optimization problems. It is derived by combining a proximal method of multipliers (PMM) strategy with a standard semismooth Newton method (SSN). The resulting linear systems are solved using a Krylov-subspace method, accelerated by certain general-purpose preconditioners which are shown to be optimal with respect to the proximal parameters. Practical efficiency is further improved by warm-starting the algorithm using a proximal alternating direction method of multipliers. We show that the outer PMM achieves global convergence under mere feasibility assumptions. Under additional standard assumptions, the PMM scheme achieves global linear and local superlinear convergence. The SSN scheme is locally superlinearly convergent, assuming that its associated linear systems are solved accurately enough, and globally convergent under certain additional regularity assumptions. We provide numerical evidence to demonstrate the effectiveness of the approach by comparing it against OSQP and IP-PMM (an ADMM and a regularized IPM solver, respectively) on several elastic-net linear regression and $L^1$-regularized PDE-constrained optimization problems.

優化器 · Analysis · 標量 · Continuity · 離散化 ·

2023 年 2 月 27 日

On the Calculation of the Brinkman Penalization Term in Density-Based Topology Optimization of Fluid-Dependent Problems

Mohamed Abdelhamid,Aleksander Czekanski

In topology optimization of fluid-dependent problems, there is a need to interpolate within the design domain between fluid and solid in a continuous fashion. In density-based methods, the concept of inverse permeability in the form of a volumetric force is utilized to enforce zero fluid velocity in non-fluid regions. This volumetric force consists of a scalar term multiplied by the fluid velocity. This scalar term takes a value between two limits as determined by a convex interpolation function. The maximum inverse permeability limit is typically chosen through a trial and error analysis of the initial form of the optimization problem; such that the fields resolved resemble those obtained through an analysis of a pure fluid domain with a body-fitted mesh. In this work, we investigate the dependency of the maximum inverse permeability limit on the mesh size and the flow conditions through analyzing the Navier-Stokes equation in its strong as well as discretized finite element forms. We use numerical experiments to verify and characterize these dependencies.

Analysis · 泛函 · 可約的 · 漢明距離 · 查準率/準確率 ·

2023 年 2 月 27 日

Runtime Analysis for Permutation-based Evolutionary Algorithms

Benjamin Doerr,Yassine Ghannane,Marouane Ibn Brahim

from arxiv, Journal version of our paper at GECCO 2022. 51 pages. arXiv admin note: substantial text overlap with arXiv:2204.07637

While the theoretical analysis of evolutionary algorithms (EAs) has made significant progress for pseudo-Boolean optimization problems in the last 25 years, only sporadic theoretical results exist on how EAs solve permutation-based problems. To overcome the lack of permutation-based benchmark problems, we propose a general way to transfer the classic pseudo-Boolean benchmarks into benchmarks defined on sets of permutations. We then conduct a rigorous runtime analysis of the permutation-based $(1+1)$ EA proposed by Scharnow, Tinnefeld, and Wegener (2004) on the analogues of the LeadingOnes and Jump benchmarks. The latter shows that, different from bit-strings, it is not only the Hamming distance that determines how difficult it is to mutate a permutation $\sigma$ into another one $\tau$, but also the precise cycle structure of $\sigma \tau^{-1}$. For this reason, we also regard the more symmetric scramble mutation operator. We observe that it not only leads to simpler proofs, but also reduces the runtime on jump functions with odd jump size by a factor of $\Theta(n)$. Finally, we show that a heavy-tailed version of the scramble operator, as in the bit-string case, leads to a speed-up of order $m^{\Theta(m)}$ on jump functions with jump size $m$. A short empirical analysis confirms these findings, but also reveals that small implementation details like the rate of void mutations can make an important difference.

PCA · Projection · 優化器 · 極大 · 相似度 ·

2023 年 2 月 24 日

On Finite-Step Convergence of the Non-Greedy Algorithm for $L_1$-Norm PCA and Beyond

The non-greedy algorithm for $L_1$-norm PCA proposed in \cite{nie2011robust} is revisited and its convergence properties are studied. The algorithm is first interpreted as a conditional subgradient or an alternating maximization method. By treating it as a conditional subgradient, the iterative points generated by the algorithm will not change in finitely many steps under a certain full-rank assumption; such an assumption can be removed when the projection dimension is one. By treating the algorithm as an alternating maximization, it is proved that the objective value will not change after at most $\left\lceil \frac{F^{\max}}{\tau_0} \right\rceil$ steps. The stopping point satisfies certain optimality conditions. Then, a variant algorithm with improved convergence properties is studied. The iterative points generated by the algorithm will not change after at most $\left\lceil \frac{2F^{\max}}{\tau} \right\rceil$ steps and the stopping point also satisfies certain optimality conditions given a small enough $\tau$. Similar finite-step convergence is also established for a slight modification of the PAMe proposed in \cite{wang2021linear} very recently under a full-rank assumption. Such an assumption can also be removed when the projection dimension is one.

優化器 · 全局優化 · 離散化 · Networking · 設計 ·

2023 年 2 月 24 日

Globally Optimal Resource Allocation Design for IRS-Assisted Multiuser Networks with Discrete Phase Shifts

Yifei Wu,Dongfang Xu,Derrick Wing Kwan Ng,Robert Schober,Wolfgang Gerstacker

Intelligent reflecting surfaces (IRSs) are envisioned as a low-cost solution to achieve high spectral and energy efficiency in future communication systems due to their ability to customize wireless propagation environments. Although resource allocation design for IRS-assisted multiuser wireless communication systems has been exhaustively investigated in the literature, the optimal design and performance of such systems are still not well understood. To fill this gap, in this paper, we study optimal resource allocation for IRS-assisted multiuser multiple-input single-output (MISO) systems. In particular, we jointly optimize the beamforming at the base station (BS) and the discrete IRS phase shifts to minimize the total transmit power. For attaining the globally optimal solution of the formulated non-convex combinatorial optimization problem, we develop a resource allocation algorithm with guaranteed convergence based on Schur's complement and the generalized Bender's decomposition. Our numerical results reveal that the proposed algorithm can significantly reduce the BS transmit power compared to the state-of-the-art suboptimal alternating optimization-based approach, especially for moderate-to-large numbers of IRS elements.

Weight · 控制器 · 類別 · 優化器 · Extensibility ·

2023 年 2 月 24 日

Optimal controller synthesis for timed systems

Damien Busatto-Gaston,Benjamin Monmege,Pierre-Alain Reynier

from arxiv, arXiv admin note: text overlap with arXiv:1812.01062 author note: indeed, this is a journal paper that contains and expands on our previous conference papers

Weighted timed games are zero-sum games played by two players on a timed automaton equipped with weights, where one player wants to minimise the cumulative weight while reaching a target. Used in a reactive synthesis perspective, this quantitative extension of timed games allows one to measure the quality of controllers in real-time systems. Weighted timed games are notoriously difficult and quickly undecidable, even when restricted to non-negative weights. For non-negative weights, the largest class that can be analysed has been introduced by Bouyer, Jaziri and Markey in 2015. Though the value problem is undecidable, the authors show how to approximate the value by considering regions with a refined granularity. In this work, we extend this class to incorporate negative weights, allowing one to model energy for instance, and prove that the value can still be approximated, with the same complexity. A small restriction also allows us to obtain a class of decidable weighted timed games with negative weights and an arbitrary number of clocks. In addition, we show that a symbolic algorithm, relying on the paradigm of value iteration, can be used as an approximation/computation schema over these classes. We also consider the special case of untimed weighted games, where the same fragments are solvable in polynomial time: this contrasts with the pseudo-polynomial complexity, known so far, for weighted games without restrictions.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

二(er)次規劃(hua)

Microsoft Surface

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<form id='u60g5'></form>

<bdo id='u60g5'><sup id='u60g5'><div id='u60g5'><bdo id='u60g5'></bdo></div></sup></bdo>