亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<dir id='4tl4F'><del id='rJooI'><del id='mC1Og'></del><pre id='RRzdj'><pre id='pIJNg'><option id='zZqpb'><address id='YDfCu'></address><bdo id='NrpLG'><tr id='vEsxP'><acronym id='r6HJA'><pre id='0QR1L'></pre></acronym><div id='uUIWf'></div></tr></bdo></option></pre><small id='Cs9mT'><address id='SxxtC'><u id='zBXGX'><legend id='Uqm8Z'><option id='DsQZb'><abbr id='iWo1a'></abbr><li id='4TDGe'><pre id='cFfN7'></pre></li></option></legend><select id='5Inha'></select></u></address></small></pre></del><sup id='I3kc1'></sup><blockquote id='dIK7L'><dt id='jMjlL'></dt></blockquote><blockquote id='7iGtJ'></blockquote></dir><tt id='Q9rCV'></tt><u id='olJa6'><tt id='6eJYD'><form id='pLla7'></form></tt><td id='viG65'><dt id='Aw4ba'></dt></td></u>

<code id='BAz4e'><i id='8oh2B'><q id='2MgW1'><legend id='EXCHv'><pre id='XGMOC'><style id='aE1EV'><acronym id='ziHaR'><i id='ssy2R'><form id='3R9ji'><option id='4eRfg'><center id='dU6NW'></center></option></form></i></acronym></style><tt id='5XUBd'></tt></pre></legend></q></i></code><center id='CyNgb'></center>

<dd id='k8A6A'></dd>

<style id='ypFI6'></style><sub id='SlKty'><dfn id='LS22a'><abbr id='3E1N6'><big id='OOj5F'><bdo id='8ddM9'></bdo></big></abbr></dfn></sub>_{<dir id='Ph9vi'></dir>}

·

Processing（編程語言） · 相關系數 · MoDELS · Continuity · 相同 ·

2021 年 12 月 30 日

From the multi-terms urn model to the self-exciting negative binomial distribution and Hawkes processes

Masato Hisakado,Kodai Hattori,Shintaro Mori

from arxiv, 17 pages, 3 figures

This study considers a new multi-term urn process that has a correlation in the same term and temporal correlation. The objective is to clarify the relationship between the urn model and the Hawkes process. Correlation in the same term is represented by the P\'{o}lya urn model and the temporal correlation is incorporated by introducing the conditional initial condition. In the double-scaling limit of this urn process, the self-exciting negative binomial distribution (SE-NBD) process, which is a marked point process, is obtained. In the standard continuous limit, this process becomes the Hawkes process, which has no correlation in the same term. The difference is the variance of the intensity function in that the phase transition from the steady to the non-steady state can be observed. The critical point, at which the power law distribution is obtained, is the same for the Hawkes and the urn processes. These two processes are used to analyze empirical data of financial default to estimate the parameters of the model. For the default portfolio, the results produced by the urn process are superior to those obtained with the Hawkes process and confirm self-excitation.

相關內容

Processing（編程語言）

Processing（編程(cheng)語言(yan)）

Processing 是(shi)一門開源編(bian)程語言和與(yu)之配套的集成開發(fa)環(huan)境（IDE）的名(ming)稱(cheng)。Processing 在電子藝(yi)術和視覺設計社區被(bei)用(yong)來教授(shou)編(bian)程基礎，并運用(yong)于大量(liang)的新媒(mei)體和互動藝(yi)術作品中(zhong)。

馬爾可夫鏈 · 近似 · 圖 · 混合 · 均勻采樣 ·

2022 年 4 月 20 日

Approximate Sampling and Counting of Graphs with Near-$P$-stable Degree Intervals

Péter L. Erd?s,Tamás Róbert Mezei,István Miklós

from arxiv, 23 pages

The approximate uniform sampling of graph realizations with a given degree sequence is an everyday task in several social science, computer science, engineering etc. projects. One approach is using Markov chains. The best available current result about the well-studied switch Markov chain is that it is rapidly mixing on P-stable degree sequences (see DOI:10.1016/j.ejc.2021.103421). The switch Markov chain does not change any degree sequence. However, there are cases where degree intervals are specified rather than a single degree sequence. (A natural scenario where this problem arises is in hypothesis testing on social networks that are only partially observed.) Rechner, Strowick, and M\"uller-Hannemann introduced in 2018 the notion of degree interval Markov chain which uses three (separately well-studied) local operations (switch, hinge-flip and toggle), and employing on degree sequence realizations where any two sequences under scrutiny have very small coordinate-wise distance. Recently Amanatidis and Kleer published a beautiful paper (arXiv:2110.09068), showing that the degree interval Markov chain is rapidly mixing if the sequences are coming from a system of very thin intervals which are centered not far from a regular degree sequence. In this paper we extend substantially their result, showing that the degree interval Markov chain is rapidly mixing if the intervals are centred at P-stable degree sequences.

流形 · 近似 · 離散化 · 核化 · 核矩陣 ·

2022 年 4 月 19 日

Graph-theoretic algorithms for Kolmogorov operators: Approximating solutions and their gradients in elliptic and parabolic problems on manifolds

Andrew D. Davis,Dimitrios Giannakis

We employ kernel-based approaches that use samples from a probability distribution to approximate a Kolmogorov operator on a manifold. The self-tuning variable-bandwidth kernel method [Berry & Harlim, Appl. Comput. Harmon. Anal., 40(1):68--96, 2016] computes a large, sparse matrix that approximates the differential operator. Here, we use the eigendecomposition of the discretization to (i) invert the operator, solving a differential equation, and (ii) represent gradient vector fields on the manifold. These methods only require samples from the underlying distribution and, therefore, can be applied in high dimensions or on geometrically complex manifolds when spatial discretizations are not available. We also employ an efficient $k$-$d$ tree algorithm to compute the sparse kernel matrix, which is a computational bottleneck.

平穩分布 · 估計/估計量 · 平穩的 · 學成 · 約束 ·

2022 年 4 月 19 日

COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation

Jongmin Lee,Cosmin Paduraru,Daniel J. Mankowitz,Nicolas Heess,Doina Precup,Kee-Eung Kim,Arthur Guez

from arxiv, 24 pages, 6 figures, Accepted at ICLR 2022 (spotlight)

We consider the offline constrained reinforcement learning (RL) problem, in which the agent aims to compute a policy that maximizes expected return while satisfying given cost constraints, learning only from a pre-collected dataset. This problem setting is appealing in many real-world scenarios, where direct interaction with the environment is costly or risky, and where the resulting policy should comply with safety constraints. However, it is challenging to compute a policy that guarantees satisfying the cost constraints in the offline RL setting, since the off-policy evaluation inherently has an estimation error. In this paper, we present an offline constrained RL algorithm that optimizes the policy in the space of the stationary distribution. Our algorithm, COptiDICE, directly estimates the stationary distribution corrections of the optimal policy with respect to returns, while constraining the cost upper bound, with the goal of yielding a cost-conservative policy for actual constraint satisfaction. Experimental results show that COptiDICE attains better policies in terms of constraint satisfaction and return-maximization, outperforming baseline algorithms.

潛變量/隱變量 · 生成模型 · MoDELS · Performer · 學成 ·

2022 年 4 月 18 日

Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models

Ali Ghadirzadeh,Petra Poklukar,Karol Arndt,Chelsea Finn,Ville Kyrki,Danica Kragic,M?rten Bj?rkman

from arxiv, arXiv admin note: substantial text overlap with arXiv:2007.13134

We present a data-efficient framework for solving sequential decision-making problems which exploits the combination of reinforcement learning (RL) and latent variable generative models. The framework, called GenRL, trains deep policies by introducing an action latent variable such that the feed-forward policy search can be divided into two parts: (i) training a sub-policy that outputs a distribution over the action latent variable given a state of the system, and (ii) unsupervised training of a generative model that outputs a sequence of motor actions conditioned on the latent action variable. GenRL enables safe exploration and alleviates the data-inefficiency problem as it exploits prior knowledge about valid sequences of motor actions. Moreover, we provide a set of measures for evaluation of generative models such that we are able to predict the performance of the RL policy training prior to the actual training on a physical robot. We experimentally determine the characteristics of generative models that have most influence on the performance of the final policy training on two robotics tasks: shooting a hockey puck and throwing a basketball. Furthermore, we empirically demonstrate that GenRL is the only method which can safely and efficiently solve the robotics tasks compared to two state-of-the-art RL methods.

MoDELS · 模型評估 · 論文 · 數值分析 ·

2022 年 4 月 18 日

An Iterative Decoupled Algorithm with Unconditional Stability for Biot Model

Huipeng Gu,Mingchao Cai,Jingzhi Li

This paper is concerned with numerical algorithms for Biot model. By introducing an intermediate variable, the classical 2-field Biot model is written into a 3-field formulation. Based on such a 3-field formulation, we propose a coupled algorithm, some time-extrapolation based decoupled algorithms, and an iterative decoupled algorithm. Our focus is the analysis of the iterative decoupled algorithm. It is shown that the convergence of the iterative decoupled algorithm requires no extra assumptions on physical parameters or stabilization parameters. Numerical experiments are provided to demonstrate the accuracy and efficiency of the proposed method.

講稿 · MoDELS · 結點 · PODC · Networking ·

2022 年 4 月 18 日

Distributed MST Computation in the Sleeping Model: Awake-Optimal Algorithms and Lower Bounds

John Augustine,William K. Moses Jr.,Gopal Pandurangan

from arxiv, 28 pages, 1 table, 5 figures, abstract modified to fit arXiv constraints

We study the distributed minimum spanning tree (MST) problem, a fundamental problem in distributed computing. It is well-known that distributed MST can be solved in $\tilde{O}(D+\sqrt{n})$ rounds in the standard CONGEST model (where $n$ is the network size and $D$ is the network diameter) and this is essentially the best possible round complexity (up to logarithmic factors). However, in resource-constrained networks such as ad hoc wireless and sensor networks, nodes spending so much time can lead to significant spending of resources such as energy. Motivated by the above consideration, we study distributed algorithms for MST under the \emph{sleeping model} [Chatterjee et al., PODC 2020], a model for design and analysis of resource-efficient distributed algorithms. In the sleeping model, a node can be in one of two modes in any round -- \emph{sleeping} or \emph{awake} (unlike the traditional model where nodes are always awake). Only the rounds in which a node is \emph{awake} are counted, while \emph{sleeping} rounds are ignored. A node spends resources only in the awake rounds and hence the main goal is to minimize the \emph{awake complexity} of a distributed algorithm, the worst-case number of rounds any node is awake. We present deterministic and randomized distributed MST algorithms that have an \emph{optimal} awake complexity of $O(\log n)$ time with a matching lower bound. We also show that our randomized awake-optimal algorithm has essentially the best possible round complexity by presenting a lower bound of $\tilde{\Omega}(n)$ on the product of the awake and round complexity of any distributed algorithm (including randomized) that outputs an MST, where $\tilde{\Omega}$ hides a $1/(\text{polylog } n)$ factor.

子空間 · 優化器 · 動力系統 · 可辨認的 · 線性組合 ·

2022 年 4 月 18 日

A dynamical systems based framework for dimension reduction

Ryeongkyung Yoon,Braxton Osting

from arxiv, 26 pages, 7 figures

We propose a novel framework for learning a low-dimensional representation of data based on nonlinear dynamical systems, which we call dynamical dimension reduction (DDR). In the DDR model, each point is evolved via a nonlinear flow towards a lower-dimensional subspace; the projection onto the subspace gives the low-dimensional embedding. Training the model involves identifying the nonlinear flow and the subspace. Following the equation discovery method, we represent the vector field that defines the flow using a linear combination of dictionary elements, where each element is a pre-specified linear/nonlinear candidate function. A regularization term for the average total kinetic energy is also introduced and motivated by optimal transport theory. We prove that the resulting optimization problem is well-posed and establish several properties of the DDR method. We also show how the DDR method can be trained using a gradient-based optimization method, where the gradients are computed using the adjoint method from optimal control theory. The DDR method is implemented and compared on synthetic and example datasets to other dimension reductions methods, including PCA, t-SNE, and Umap.

分離的 · MoDELS · 秩 · INFORMS · 信息理論 ·

2022 年 4 月 16 日

Cannikin's Law in Tensor Modeling: A Rank Study for Entanglement and Separability in Tensor Complexity and Model Capacity

from arxiv, 7 pages

This study clarifies the proper criteria to assess the modeling capacity of a general tensor model. The work analyze the problem based on the study of tensor ranks, which is not a well-defined quantity for higher order tensors. To process, the author introduces the separability issue to discuss the Cannikin's law of tensor modeling. Interestingly, a connection between entanglement studied in information theory and tensor analysis is established, shedding new light on the theoretical understanding for modeling capacity problems.

Single-Shot · 可約的 · Performer · Extensibility · 優化器 ·

2022 年 4 月 15 日

Single-shot Embedding Dimension Search in Recommender System

Liang Qu,Yonghong Ye,Ningzhi Tang,Lixin Zhang,Yuhui Shi,Hongzhi Yin

As a crucial component of most modern deep recommender systems, feature embedding maps high-dimensional sparse user/item features into low-dimensional dense embeddings. However, these embeddings are usually assigned a unified dimension, which suffers from the following issues: (1) high memory usage and computation cost. (2) sub-optimal performance due to inferior dimension assignments. In order to alleviate the above issues, some works focus on automated embedding dimension search by formulating it as hyper-parameter optimization or embedding pruning problems. However, they either require well-designed search space for hyperparameters or need time-consuming optimization procedures. In this paper, we propose a Single-Shot Embedding Dimension Search method, called SSEDS, which can efficiently assign dimensions for each feature field via a single-shot embedding pruning operation while maintaining the recommendation accuracy of the model. Specifically, it introduces a criterion for identifying the importance of each embedding dimension for each feature field. As a result, SSEDS could automatically obtain mixed-dimensional embeddings by explicitly reducing redundant embedding dimensions based on the corresponding dimension importance ranking and the predefined parameter budget. Furthermore, the proposed SSEDS is model-agnostic, meaning that it could be integrated into different base recommendation models. The extensive offline experiments are conducted on two widely used public datasets for CTR prediction tasks, and the results demonstrate that SSEDS can still achieve strong recommendation performance even if it has reduced 90\% parameters. Moreover, SSEDS has also been deployed on the WeChat Subscription platform for practical recommendation services. The 7-day online A/B test results show that SSEDS can significantly improve the performance of the online recommendation model.

樣本 · 類別 · 損失 · Performer · SimPLe ·

2019 年 1 月 16 日

Class-Balanced Loss Based on Effective Number of Samples

Yin Cui,Menglin Jia,Tsung-Yi Lin,Yang Song,Serge Belongie

from arxiv, Code is available at: //github.com/richardaecn/class-balanced-loss

With the rapid increase of large-scale, real-world datasets, it becomes critical to address the problem of long-tailed data distribution (i.e., a few classes account for most of the data, while most classes are under-represented). Existing solutions typically adopt class re-balancing strategies such as re-sampling and re-weighting based on the number of observations for each class. In this work, we argue that as the number of samples increases, the additional benefit of a newly added data point will diminish. We introduce a novel theoretical framework to measure data overlap by associating with each sample a small neighboring region rather than a single point. The effective number of samples is defined as the volume of samples and can be calculated by a simple formula $(1-\beta^{n})/(1-\beta)$, where $n$ is the number of samples and $\beta \in [0,1)$ is a hyperparameter. We design a re-weighting scheme that uses the effective number of samples for each class to re-balance the loss, thereby yielding a class-balanced loss. Comprehensive experiments are conducted on artificially induced long-tailed CIFAR datasets and large-scale datasets including ImageNet and iNaturalist. Our results show that when trained with the proposed class-balanced loss, the network is able to achieve significant performance gains on long-tailed datasets.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

Processing（編程語言）

相(xiang)關系(xi)數(shu)

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tfoot id='ck5lt'></tfoot>

<legend id='ck5lt'><style id='ck5lt'><dir id='ck5lt'><q id='ck5lt'></q></dir></style></legend>

<i id='ck5lt'><tr id='ck5lt'><dt id='ck5lt'><q id='ck5lt'><span id='ck5lt'><b id='ck5lt'><form id='ck5lt'><ins id='ck5lt'></ins><ul id='ck5lt'></ul><sub id='ck5lt'></sub></form><legend id='ck5lt'></legend><bdo id='ck5lt'><pre id='ck5lt'><center id='ck5lt'></center></pre></bdo></b><th id='ck5lt'></th></span></q></dt></tr></i><div id='ck5lt'><tfoot id='ck5lt'></tfoot><dl id='ck5lt'><fieldset id='ck5lt'></fieldset></dl></div>