亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<li id='ryyv1'></li>

_{^{<dd id='ryyv1'><tbody id='ryyv1'><td id='ryyv1'><optgroup id='ryyv1'><strong id='ryyv1'></strong></optgroup><address id='ryyv1'><ul id='ryyv1'></ul></address><big id='ryyv1'></big></td><table id='ryyv1'></table></tbody><pre id='ryyv1'></pre></dd><span id='ryyv1'><b id='ryyv1'></b></span>}}


<dfn id='ryyv1'><optgroup id='ryyv1'></optgroup></dfn><tfoot id='ryyv1'><bdo id='ryyv1'><div id='ryyv1'></div><i id='ryyv1'><dt id='ryyv1'></dt></i></bdo></tfoot>

_{<fieldset id='ryyv1'></fieldset>}

·

離散化 · 估計/估計量 · Analysis · 泛函 · 平滑 ·

2024 年 2 月 2 日

$hp$-FEM for Elastoplasticity & $hp$-Adaptivity Based on Local Error Reductions

from arxiv, PhD thesis (corrected version without paper-PDFs), 52 pages

The first part of the cumulative thesis contains the numerical analysis of different $hp$-finite element discretizations related to two different weak formulations of a model problem in elastoplasticity with linearly kinematic hardening. Thereby, the weak formulation either takes the form of a variational inequality of the second kind, including a non-differentiable plasticity functional, or represents a mixed formulation, in which the non-smooth plasticity functional is resolved by a Lagrange multiplier. As the non-differentiability of the plasticity functional causes many difficulties in the numerical analysis and the computation of a discrete solution it seems advantageous to consider discretizations of the mixed formulation. In a first work, an a priori error analysis of an higher-order finite element discretization of the mixed formulation (explicitly including the discretization of the Lagrange multiplier) is presented. The relations between the three different $hp$-discretizations are studied in a second work where also a reliable a posteriori error estimator that also satisfies some (local) efficiency estimates is derived. In a third work, an efficient semi-smooth Newton solver is proposed, which is obtained by reformulating a discretization of the mixed formulation as a system of decoupled nonlinear equations. The second part of the thesis introduces a new $hp$-adaptive algorithm for solving variational equations, in which the automatic mesh refinement does not rely on the use of an a posteriori error estimator or smoothness indicators but is based on comparing locally predicted error reductions.

相關內容

離散化

樣本復雜度 · 優化器 · 樣本 · Minimax · 張成子空間 ·

2024 年 3 月 18 日

Span-Based Optimal Sample Complexity for Weakly Communicating and General Average Reward MDPs

Matthew Zurek,Yudong Chen

from arxiv, 42 pages, 2 figures; this article supersedes arXiv:2311.13469

We study the sample complexity of learning an $\epsilon$-optimal policy in an average-reward Markov decision process (MDP) under a generative model. For weakly communicating MDPs, we establish the complexity bound $\tilde{O}(SA\frac{H}{\epsilon^2})$, where $H$ is the span of the bias function of the optimal policy and $SA$ is the cardinality of the state-action space. Our result is the first that is minimax optimal (up to log factors) in all parameters $S,A,H$ and $\epsilon$, improving on existing work that either assumes uniformly bounded mixing times for all policies or has suboptimal dependence on the parameters. We further investigate sample complexity in general (non-weakly-communicating) average-reward MDPs. We argue a new transient time parameter $B$ is necessary, establish an $\tilde{O}(SA\frac{B+H}{\epsilon^2})$ complexity bound, and prove a matching (up to log factors) minimax lower bound. Both results are based on reducing the average-reward MDP to a discounted MDP, which requires new ideas in the general setting. To establish the optimality of this reduction, we develop improved bounds for $\gamma$-discounted MDPs, showing that $\tilde{\Omega}\left(SA\frac{H}{(1-\gamma)^2\epsilon^2}\right)$ samples suffice to learn an $\epsilon$-optimal policy in weakly communicating MDPs under the regime that $\gamma\geq 1-1/H$, and $\tilde{\Omega}\left(SA\frac{B+H}{(1-\gamma)^2\epsilon^2}\right)$ samples suffice in general MDPs when $\gamma\geq 1-\frac{1}{B+H}$. Both these results circumvent the well-known lower bound of $\tilde{\Omega}\left(SA\frac{1}{(1-\gamma)^3\epsilon^2}\right)$ for arbitrary $\gamma$-discounted MDPs. Our analysis develops upper bounds on certain instance-dependent variance parameters in terms of the span and transient time parameters. The weakly communicating bounds are tighter than those based on the mixing time or diameter of the MDP and may be of broader use.

穩健性 · 優化器 · 情景 · 線性的 · 對偶問題 ·

2024 年 3 月 16 日

A Primal-Dual Algorithm for Faster Distributionally Robust Optimization

Ronak Mehta,Jelena Diakonikolas,Zaid Harchaoui

We consider the penalized distributionally robust optimization (DRO) problem with a closed, convex uncertainty set, a setting that encompasses the $f$-DRO, Wasserstein-DRO, and spectral/$L$-risk formulations used in practice. We present Drago, a stochastic primal-dual algorithm that achieves a state-of-the-art linear convergence rate on strongly convex-strongly concave DRO problems. The method combines both randomized and cyclic components with mini-batching, which effectively handles the unique asymmetric nature of the primal and dual problems in DRO. We support our theoretical results with numerical benchmarks in classification and regression.

正則化項 · Neural Networks · 全連接神經網絡 · Networking · Performer ·

2024 年 3 月 15 日

Matrix Completion via Nonsmooth Regularization of Fully Connected Neural Networks

Sajad Faramarzi,Farzan Haddadi,Sajjad Amini,Masoud Ahookhosh

Conventional matrix completion methods approximate the missing values by assuming the matrix to be low-rank, which leads to a linear approximation of missing values. It has been shown that enhanced performance could be attained by using nonlinear estimators such as deep neural networks. Deep fully connected neural networks (FCNNs), one of the most suitable architectures for matrix completion, suffer from over-fitting due to their high capacity, which leads to low generalizability. In this paper, we control over-fitting by regularizing the FCNN model in terms of the $\ell_{1}$ norm of intermediate representations and nuclear norm of weight matrices. As such, the resulting regularized objective function becomes nonsmooth and nonconvex, i.e., existing gradient-based methods cannot be applied to our model. We propose a variant of the proximal gradient method and investigate its convergence to a critical point. In the initial epochs of FCNN training, the regularization terms are ignored, and through epochs, the effect of that increases. The gradual addition of nonsmooth regularization terms is the main reason for the better performance of the deep neural network with nonsmooth regularization terms (DNN-NSR) algorithm. Our simulations indicate the superiority of the proposed algorithm in comparison with existing linear and nonlinear algorithms.

平穩的 · 情景 · 線性的 · 離散化 · MoDELS ·

2024 年 3 月 15 日

Formalization of Asymptotic Convergence for Stationary Iterative Methods

Mohit Tekriwal,Joshua Miller,Jean-Baptiste Jeannin

from arxiv, This paper has been accepted for publication at the NFM, 2024 conference

Solutions to differential equations, which are used to model physical systems, are computed numerically by solving a set of discretized equations. This set of discretized equations is reduced to a large linear system, whose solution is typically found using an iterative solver. We start with an initial guess, $x_0$, and iterate the algorithm to obtain a sequence of solution vectors, $x_k$, which are approximations to the exact solution of the linear system, $x$. The iterative algorithm is said to converge to $x$, in the field of reals, if and only if $x_k$ converges to $x$ in the limit of $k \to \infty$. In this paper, we formally prove the asymptotic convergence of a particular class of iterative methods called the stationary iterative methods, in the Coq theorem prover. We formalize the necessary and sufficient conditions required for the iterative convergence, and extend this result to two classical iterative methods: the Gauss--Seidel method and the Jacobi method. For the Gauss--Seidel method, we also formalize a set of easily testable conditions for iterative convergence, called the Reich theorem, for a particular matrix structure, and apply this on a model problem of the one-dimensional heat equation. We also apply the main theorem of iterative convergence to prove convergence of the Jacobi method on the model problem.

約束 · CC · binary · 講稿 · 符號學 ·

2024 年 3 月 14 日

Complexity Classification of Complex-Weighted Counting Acyclic Constraint Satisfaction Problems

Tomoyuki Yamakami

from arxiv, (A4, 10pt, 17 pages) An extended abstract of this current article is scheduled to appear in the Proceedings of the 12th Computing Conference, London, UK, July 11--12, 2024, Lecture Notes in Networks and Systems, Springer-Verlag, 2024

We study the computational complexity of counting constraint satisfaction problems (#CSPs) whose constraints assign complex numbers to Boolean inputs when the corresponding constraint hypergraphs are acyclic. These problems are called acyclic #CSPs or succinctly, #ACSPs. We wish to determine the computational complexity of all such #ACSPs when arbitrary unary constraints are freely available. Depending on whether we further allow or disallow the free use of the specific constraint XOR (binary disequality), we present two complexity classifications of the #ACSPs according to the types of constraints used for the problems. When XOR is freely available, we first obtain a complete dichotomy classification. On the contrary, when XOR is not available for free, we then obtain a trichotomy classification. To deal with an acyclic nature of constraints in those classifications, we develop a new technical tool called acyclic-T-constructibility or AT-constructibility, and we exploit it to analyze a complexity upper bound of each #ACSPs.

近似 · 大學 · 樣本 · 易處理的 · 線性的 ·

2024 年 3 月 13 日

Improved Randomized Approximation of Hard Universality and Emptiness Problems

Pantelis Andreou,Stavros Konstantinidis,Taylor J. Smith

We build on recent research on polynomial randomized approximation (PRAX) algorithms for the hard problems of NFA universality and NFA equivalence. Loosely speaking, PRAX algorithms use sampling of infinite domains within any desired accuracy $\delta$. In the spirit of experimental mathematics, we extend the concept of PRAX algorithms to be applicable to the emptiness and universality problems in any domain whose instances admit a tractable distribution as defined in this paper. A technical result here is that a linear (w.r.t. $1/\delta$) number of samples is sufficient, as opposed to the quadratic number of samples in previous papers. We show how the improved and generalized PRAX algorithms apply to universality and emptiness problems in various domains: ordinary automata, tautology testing of propositions, 2D automata, and to solution sets of certain Diophantine equations.

樣本 · 離散化 · Batch Size · 計算學習理論 ·

2024 年 3 月 13 日

Tight Group-Level DP Guarantees for DP-SGD with Sampling via Mixture of Gaussians Mechanisms

from arxiv, v2: Added links to open-source implementation of PLD accounting for MoG mechanisms

We give a procedure for computing group-level $(\epsilon, \delta)$-DP guarantees for DP-SGD, when using Poisson sampling or fixed batch size sampling. Up to discretization errors in the implementation, the DP guarantees computed by this procedure are tight (assuming we release every intermediate iterate).

MoDELS · 語言模型化 · Performer · 大語言模型 · 數學 ·

2024 年 3 月 13 日

Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models

Ning Ding,Yulin Chen,Ganqu Cui,Xingtai Lv,Ruobing Xie,Bowen Zhou,Zhiyuan Liu,Maosong Sun

Underlying data distributions of natural language, programming code, and mathematical symbols vary vastly, presenting a complex challenge for large language models (LLMs) that strive to achieve high performance across all three domains simultaneously. Achieving a very high level of proficiency for an LLM within a specific domain often requires extensive training with relevant corpora, which is typically accompanied by a sacrifice in performance in other domains. In this paper, we propose to fuse models that are already highly-specialized directly. The proposed fusing framework, UltraFuser, consists of three distinct specialists that are already sufficiently trained on language, coding, and mathematics. A token-level gating mechanism is introduced to blend the specialists' outputs. A two-stage training strategy accompanied by balanced sampling is designed to ensure stability. To effectively train the fused model, we further construct a high-quality supervised instruction tuning dataset, UltraChat 2, which includes text, code, and mathematical content. This dataset comprises approximately 300,000 instructions and covers a wide range of topics in each domain. Experiments show that our model could simultaneously achieve mastery of the three crucial domains.

圖 · Networking · INTERACT · INFORMS · 圖形處理器 ·

2020 年 11 月 25 日

Time-Series Event Prediction with Evolutionary State Graph

Wenjie Hu,Yang Yang,Ziqiang Cheng,Carl Yang,Xiang Ren

from arxiv, A long version of EvoNet (WSDM 2021)

The accurate and interpretable prediction of future events in time-series data often requires the capturing of representative patterns (or referred to as states) underpinning the observed data. To this end, most existing studies focus on the representation and recognition of states, but ignore the changing transitional relations among them. In this paper, we present evolutionary state graph, a dynamic graph structure designed to systematically represent the evolving relations (edges) among states (nodes) along time. We conduct analysis on the dynamic graphs constructed from the time-series data and show that changes on the graph structures (e.g., edges connecting certain state nodes) can inform the occurrences of events (i.e., time-series fluctuation). Inspired by this, we propose a novel graph neural network model, Evolutionary State Graph Network (EvoNet), to encode the evolutionary state graph for accurate and interpretable time-series event prediction. Specifically, Evolutionary State Graph Network models both the node-level (state-to-state) and graph-level (segment-to-segment) propagation, and captures the node-graph (state-to-segment) interactions over time. Experimental results based on five real-world datasets show that our approach not only achieves clear improvements compared with 11 baselines, but also provides more insights towards explaining the results of event predictions.

語音增強 · 估計/估計量 · MoDELS · 損失函數（機器學習） · Performer ·

2019 年 3 月 7 日

Phase-aware Speech Enhancement with Deep Complex U-Net

Hyeong-Seok Choi,Jang-Hyun Kim,Jaesung Huh,Adrian Kim,Jung-Woo Ha,Kyogu Lee

from arxiv, Accepted paper at International Conference on Learning Representations (ICLR) 2019

Most deep learning-based models for speech enhancement have mainly focused on estimating the magnitude of spectrogram while reusing the phase from noisy speech for reconstruction. This is due to the difficulty of estimating the phase of clean speech. To improve speech enhancement performance, we tackle the phase estimation problem in three ways. First, we propose Deep Complex U-Net, an advanced U-Net structured model incorporating well-defined complex-valued building blocks to deal with complex-valued spectrograms. Second, we propose a polar coordinate-wise complex-valued masking method to reflect the distribution of complex ideal ratio masks. Third, we define a novel loss function, weighted source-to-distortion ratio (wSDR) loss, which is designed to directly correlate with a quantitative evaluation measure. Our model was evaluated on a mixture of the Voice Bank corpus and DEMAND database, which has been widely used by many deep learning models for speech enhancement. Ablation experiments were conducted on the mixed dataset showing that all three proposed approaches are empirically valid. Experimental results show that the proposed method achieves state-of-the-art performance in all metrics, outperforming previous approaches by a large margin.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

估計/估計量

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tfoot id='ryyv1'></tfoot>

<legend id='ryyv1'><style id='ryyv1'><dir id='ryyv1'><q id='ryyv1'></q></dir></style></legend>

<i id='ryyv1'><tr id='ryyv1'><dt id='ryyv1'><q id='ryyv1'><span id='ryyv1'><b id='ryyv1'><form id='ryyv1'><ins id='ryyv1'></ins><ul id='ryyv1'></ul><sub id='ryyv1'></sub></form><legend id='ryyv1'></legend><bdo id='ryyv1'><pre id='ryyv1'><center id='ryyv1'></center></pre></bdo></b><th id='ryyv1'></th></span></q></dt></tr></i><div id='ryyv1'><tfoot id='ryyv1'></tfoot><dl id='ryyv1'><fieldset id='ryyv1'></fieldset></dl></div>