亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

·

CCS · Processing（編程語言） · 貝葉斯風險 · 可約的 · 控制器 ·

2021 年 8 月 6 日

Anomaly Search with Multiple Plays under Delay and Switching Costs

Tidhar Lambez,Kobi Cohen

from arxiv, 30 pages, 7 figures (14 images)

The problem of searching for $L$ anomalous processes among $M$ processes is considered. At each time, the decision maker can observe a subset of $K$ processes (i.e., multiple plays). The measurement drawn when observing a process follows one of two different distributions, depending on whether the process is normal or abnormal. The goal is to design a policy that minimizes the Bayes risk which balances between the sample complexity, detection errors, and the switching cost associated with switching across processes. We develop a policy, dubbed consecutive controlled sensing (CCS), to achieve this goal. On the one hand, by contrast to existing studies on controlled sensing, the CCS policy senses processes consecutively to reduce the switching cost. On the other hand, the policy controls the sensing operation in a closed-loop manner to switch between processes when necessary to guarantee reliable inference. We prove theoretically that CCS is asymptotically optimal in terms of minimizing the Bayes risk as the detection error approaches zero (i.e., the sample complexity increases). Simulation results demonstrate strong performance of CCS in the finite regime as well.

相關內容

CCS

CCS：ACM Conference on Computer and Communications Security。 Explanation：計算機和通信安全會議。 Publisher：ACM。 SIT：

Machine Learning · 學成 · Continuity · 全 · Processing（編程語言） ·

2021 年 10 月 5 日

The Potential of Machine Learning to Enhance Computational Fluid Dynamics

Ricardo Vinuesa,Steven L. Brunton

from arxiv, 13 pages, 5 figures

Machine learning is rapidly becoming a core technology for scientific computing, with numerous opportunities to advance the field of computational fluid dynamics. This paper highlights some of the areas of highest potential impact, including to accelerate direct numerical simulations, to improve turbulence closure modelling, and to develop enhanced reduced-order models. In each of these areas, it is possible to improve machine learning capabilities by incorporating physics into the process, and in turn, to improve the simulation of fluids to uncover new physical understanding. Despite the promise of machine learning described here, we also note that classical methods are often more efficient for many tasks. We also emphasize that in order to harness the full potential of machine learning to improve computational fluid dynamics, it is essential for the community to continue to establish benchmark systems and best practices for open-source software, data sharing, and reproducible research.

可約的 · 在線推斷 · 累積分布函數 · INFORMS · Performer ·

2021 年 10 月 4 日

Online multiple testing with super-uniformity reward

Sebastian D?hler,Iqraa Meah,Etienne Roquain

Valid online inference is an important problem in contemporary multiple testing research, to which various solutions have been proposed recently. It is well-known that these methods can suffer from a significant loss of power if the null $p$-values are conservative. This occurs frequently, for instance whenever discrete tests are performed. To reduce conservatism, we introduce the method of super-uniformity reward (SURE). This approach works by incorporating information about the individual null cumulative distribution functions (or upper bounds of them), which we assume to be available. Our approach yields several new "rewarded" procedures that theoretically control online error criteria based either on the family-wise error rate (FWER) or the marginal false discovery rate (mFDR). We prove that the rewarded procedures uniformly improve upon the non-rewarded ones, and illustrate their performance for simulated and real data.

估計/估計量 · 優化器 · 置信度 · 估計誤差 · 寬度 ·

2021 年 10 月 3 日

Optimal Change-Point Detection with Training Sequences in the Large and Moderate Deviations Regimes

Haiyun He,Qiaosheng Zhang,Vincent Y. F. Tan

from arxiv, 27 pages, 11 figures

This paper investigates a novel offline change-point detection problem from an information-theoretic perspective. In contrast to most related works, we assume that the knowledge of the underlying pre- and post-change distributions are not known and can only be learned from the training sequences which are available. We further require the probability of the \emph{estimation error} to decay either exponentially or sub-exponentially fast (corresponding respectively to the large and moderate deviations regimes in information theory parlance). Based on the training sequences as well as the test sequence consisting of a single change-point, we design a change-point estimator and further show that this estimator is optimal by establishing matching (strong) converses. This leads to a full characterization of the optimal confidence width (i.e., half the width of the confidence interval within which the true change-point is located at with high probability) as a function of the undetected error, under both the large and moderate deviations regimes.

Facebook AI Research · 在線 · Continuity · INFORMS · 情景 ·

2021 年 10 月 2 日

Online Market Equilibrium with Application to Fair Division

Yuan Gao,Christian Kroer,Alex Peysakhovich

from arxiv, An earlier version was accepted at NeurIPS 2021

Computing market equilibria is a problem of both theoretical and applied interest. Much research to date focuses on the case of static Fisher markets with full information on buyers' utility functions and item supplies. Motivated by real-world markets, we consider an online setting: individuals have linear, additive utility functions; items arrive sequentially and must be allocated and priced irrevocably. We define the notion of an online market equilibrium in such a market as time-indexed allocations and prices which guarantee buyer optimality and market clearance in hindsight. We propose a simple, scalable and interpretable allocation and pricing dynamics termed as PACE. When items are drawn i.i.d. from an unknown distribution (with a possibly continuous support), we show that PACE leads to an online market equilibrium asymptotically. In particular, PACE ensures that buyers' time-averaged utilities converge to the equilibrium utilities w.r.t. a static market with item supplies being the unknown distribution and that buyers' time-averaged expenditures converge to their per-period budget. Hence, many desirable properties of market equilibrium-based fair division such as no envy, Pareto optimality, and the proportional-share guarantee are also attained asymptotically in the online setting. Next, we extend the dynamics to handle quasilinear buyer utilities, which gives the first online algorithm for computing first-price pacing equilibria. Finally, numerical experiments on real and synthetic datasets show that the dynamics converges quickly under various metrics.

多樣性 · 多樣性度量/差異性度量 · 情景 · 演化計算 · 進化計算 ·

2021 年 10 月 1 日

Evolving Diverse Sets of Tours for the Travelling Salesperson Problem

Anh Viet Do,Jakob Bossek,Aneta Neumann,Frank Neumann

from arxiv, 11 pages, 3 tables, 3 figures, published in GECCO '20; proof of Theorem 1 corrected

Evolving diverse sets of high quality solutions has gained increasing interest in the evolutionary computation literature in recent years. With this paper, we contribute to this area of research by examining evolutionary diversity optimisation approaches for the classical Traveling Salesperson Problem (TSP). We study the impact of using different diversity measures for a given set of tours and the ability of evolutionary algorithms to obtain a diverse set of high quality solutions when adopting these measures. Our studies show that a large variety of diverse high quality tours can be achieved by using our approaches. Furthermore, we compare our approaches in terms of theoretical properties and the final set of tours obtained by the evolutionary diversity optimisation algorithm.

異常檢測 · MoDELS · 可約的 · 判別器 · 判別方法 ·

2019 年 1 月 28 日

Anomaly DetectionWith Multiple-Hypotheses Predictions

Duc Tam Nguyen,Zhongyu Lou,Michael Klar,Thomas Brox

In one-class-learning tasks, only the normal case (foreground) can be modeled with data, whereas the variation of all possible anomalies is too erratic to be described by samples. Thus, due to the lack of representative data, the wide-spread discriminative approaches cannot cover such learning tasks, and rather generative models, which attempt to learn the input density of the foreground, are used. However, generative models suffer from a large input dimensionality (as in images) and are typically inefficient learners. We propose to learn the data distribution of the foreground more efficiently with a multi-hypotheses autoencoder. Moreover, the model is criticized by a discriminator, which prevents artificial data modes not supported by data, and enforces diversity across hypotheses. Our multiple-hypothesesbased anomaly detection framework allows the reliable identification of out-of-distribution samples. For anomaly detection on CIFAR-10, it yields up to 3.9% points improvement over previously reported results. On a real anomaly detection task, the approach reduces the error of the baseline models from 6.8% to 1.5%.

學成 · 深度強化學習 · 強化學習 · 回合 · 查準率/準確率 ·

2018 年 10 月 16 日

At Human Speed: Deep Reinforcement Learning with Action Delay

Vlad Firoiu,Tina Ju,Josh Tenenbaum

There has been a recent explosion in the capabilities of game-playing artificial intelligence. Many classes of tasks, from video games to motor control to board games, are now solvable by fairly generic algorithms, based on deep learning and reinforcement learning, that learn to play from experience with minimal prior knowledge. However, these machines often do not win through intelligence alone -- they possess vastly superior speed and precision, allowing them to act in ways a human never could. To level the playing field, we restrict the machine's reaction time to a human level, and find that standard deep reinforcement learning methods quickly drop in performance. We propose a solution to the action delay problem inspired by human perception -- to endow agents with a neural predictive model of the environment which "undoes" the delay inherent in their environment -- and demonstrate its efficacy against professional players in Super Smash Bros. Melee, a popular console fighting game.

估計/估計量 · 學成 · 混淆矩陣 · 無偏 · 強化學習 ·

2018 年 10 月 5 日

Reinforcement Learning with Perturbed Rewards

Jingkang Wang,Yang Liu,Bo Li

Recent studies have shown the vulnerability of reinforcement learning (RL) models in noisy settings. The sources of noises differ across scenarios. For instance, in practice, the observed reward channel is often subject to noise (e.g., when observed rewards are collected through sensors), and thus observed rewards may not be credible as a result. Also, in applications such as robotics, a deep reinforcement learning (DRL) algorithm can be manipulated to produce arbitrary errors. In this paper, we consider noisy RL problems where observed rewards by RL agents are generated with a reward confusion matrix. We call such observed rewards as perturbed rewards. We develop an unbiased reward estimator aided robust RL framework that enables RL agents to learn in noisy environments while observing only perturbed rewards. Our framework draws upon approaches for supervised learning with noisy data. The core ideas of our solution include estimating a reward confusion matrix and defining a set of unbiased surrogate rewards. We prove the convergence and sample complexity of our approach. Extensive experiments on different DRL platforms show that policies based on our estimated surrogate reward can achieve higher expected rewards, and converge faster than existing baselines. For instance, the state-of-the-art PPO algorithm is able to obtain 67.5% and 46.7% improvements in average on five Atari games, when the error rates are 10% and 30% respectively.

MoDELS · SimPLe · CC · 模型評估 · 高斯混合（模型） ·

2018 年 2 月 24 日

The Search Problem in Mixture Models

Avik Ray,Joe Neeman,Sujay Sanghavi,Sanjay Shakkottai

We consider the task of learning the parameters of a {\em single} component of a mixture model, for the case when we are given {\em side information} about that component, we call this the "search problem" in mixture models. We would like to solve this with computational and sample complexity lower than solving the overall original problem, where one learns parameters of all components. Our main contributions are the development of a simple but general model for the notion of side information, and a corresponding simple matrix-based algorithm for solving the search problem in this general setting. We then specialize this model and algorithm to four common scenarios: Gaussian mixture models, LDA topic models, subspace clustering, and mixed linear regression. For each one of these we show that if (and only if) the side information is informative, we obtain parameter estimates with greater accuracy, and also improved computation complexity than existing moment based mixture model algorithms (e.g. tensor methods). We also illustrate several natural ways one can obtain such side information, for specific problem instances. Our experiments on real data sets (NY Times, Yelp, BSDS500) further demonstrate the practicality of our algorithms showing significant improvement in runtime and accuracy.

主動學習 · binary · 圖像分割 · 平滑先驗 · 學成 ·

2018 年 1 月 16 日

Geometry in Active Learning for Binary and Multi-class Image Segmentation

Ksenia Konyushkova,Raphael Sznitman,Pascal Fua

from arxiv, Extension of our previous paper arXiv:1508.04955

We propose an Active Learning approach to image segmentation that exploits geometric priors to streamline the annotation process. We demonstrate this for both background-foreground and multi-class segmentation tasks in 2D images and 3D image volumes. Our approach combines geometric smoothness priors in the image space with more traditional uncertainty measures to estimate which pixels or voxels are most in need of annotation. For multi-class settings, we additionally introduce two novel criteria for uncertainty. In the 3D case, we use the resulting uncertainty measure to show the annotator voxels lying on the same planar patch, which makes batch annotation much easier than if they were randomly distributed in the volume. The planar patch is found using a branch-and-bound algorithm that finds a patch with the most informative instances. We evaluate our approach on Electron Microscopy and Magnetic Resonance image volumes, as well as on regular images of horses and faces. We demonstrate a substantial performance increase over state-of-the-art approaches.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

Processing（編程語言）

貝葉斯風險

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tr id='p12wi'><strong id='p12wi'></strong><small id='p12wi'></small><button id='p12wi'></button><li id='p12wi'><noscript id='p12wi'><big id='p12wi'></big><dt id='p12wi'></dt></noscript></li></tr><ol id='p12wi'><option id='p12wi'><table id='p12wi'><blockquote id='p12wi'><tbody id='p12wi'></tbody></blockquote></table></option></ol><u id='p12wi'></u><kbd id='p12wi'><kbd id='p12wi'></kbd></kbd>

<code id='p12wi'><strong id='p12wi'></strong></code>

<fieldset id='p12wi'></fieldset>

<span id='p12wi'></span>

<ins id='p12wi'></ins>

<acronym id='p12wi'><em id='p12wi'></em><td id='p12wi'><div id='p12wi'></div></td></acronym><address id='p12wi'><big id='p12wi'><big id='p12wi'></big><legend id='p12wi'></legend></big></address>

<i id='p12wi'><div id='p12wi'><ins id='p12wi'></ins></div></i>

<i id='p12wi'></i>