斗破苍穹第四季25集免费观看_甜味弥漫一区二区在线观看_成人午夜免费视频国产在线_久久久久久中文字幕无码_一区二区三区日韩欧美在线_稀缺资源小12萝裸体视频福利2021_亚洲成女一区二区沤在线

報告主題：Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion

報告摘(zhai)要：We study the dynamic risk-sensitive portfolio allocation in a regime-switching credit market with default contagion. The state space of the Markovian regime-switching process is assumed to be a countably infinite set. To characterize the value function of the risk sensitive stochastic control problem, we investigate the corresponding recursive infinite-dimensional nonlinear dynamical programming equations (DPEs) based on default states. We propose to work in the following procedure: Applying the theory of the monotone dynamical system, we first establish the existence and uniqueness of classical solutions to the recursive DPEs by a truncation argument in the finite state space. Moreover, the associated optimal feedback strategy is characterized by developing a rigorous verification theorem. Building upon results in the first stage, we construct a sequence of approximating risk sensitive control problems with finite state space and prove that the resulting smooth value functions will converge to the classical solution of the system of DPEs. The construction and approximation of the optimal feedback strategy for the original problem are thoroughly discussed. Some numerical results are also presented to illustrate our analytical conclusions. Joint work with Lijun Bo (USTC) and Huafu Liao (USTC).

嘉賓簡介：余翔博(bo)士現任職于香港理(li)工(gong)大(da)(da)學(xue)應用數學(xue)系助理(li)教(jiao)授。余博(bo)士2007年(nian)本科畢業(ye)于華中科技大(da)(da)學(xue)數學(xue)系信息與計算科學(xue)專(zhuan)業(ye)，并在2012年(nian)5月獲得美國(guo)德州大(da)(da)學(xue)奧斯汀(ting)分(fen)校數學(xue)博(bo)士學(xue)位(wei)。他的研究工(gong)作發表于 Annals of Applied Probability ， Mathematical Finance, SIAM Journal on Control and Optimization, Mathematics

付費5元查看完整內容

2019 年 10 月 22 日

[付費5元查看完整內容]Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大學數據科學與計(ji)算(suan)機學院權(quan)小軍教授(shou)，第(di)八(ba)屆全國社會媒體(ti)處理大會SMP2019

專知會員服務

專知，提供專業可信的知識分發服務，讓認知協作更快更好！

報(bao)告(gao)主題：Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis

報告摘要(yao)：Aspect-based sentiment analysis aims to determine the sentimental polarity towards a specific aspect in reviews or comments. Recent attempts mostly adopt attention-based mechanisms to link opinion words to their respective aspects in an implicit way. However, due to the tangle of multiple aspects or opinion words occurred in one sentence, the models often mix up the linkages. In this paper, we propose to encode sentence syntax explicitly to improve the effect of the linkages. We define an aspect-oriented dependency tree structure, which is reshaped and pruned from an ordinary parse tree, to express useful syntax information. The new tree is then encoded into a multifaceted syntax network, to be used in combination with attention-based models for prediction. Experimental results on three datasets from SemEval 2014 and Twitter show that, with our syntax network, the aspect-sentiment linkages can be better established and the attention-based models are substantially improved as a result.

嘉賓簡介：權小軍(jun)，教授，博士(shi)生導(dao)師。先后于(yu)中國(guo)科學(xue)(xue)(xue)(xue)技(ji)術(shu)大學(xue)(xue)(xue)(xue)計(ji)算(suan)機系、香(xiang)(xiang)港城(cheng)市大學(xue)(xue)(xue)(xue)計(ji)算(suan)機系、美國(guo)羅格斯大學(xue)(xue)(xue)(xue)商學(xue)(xue)(xue)(xue)院(yuan)、美國(guo)普渡大學(xue)(xue)(xue)(xue)計(ji)算(suan)機系、香(xiang)(xiang)港城(cheng)市大學(xue)(xue)(xue)(xue)語言學(xue)(xue)(xue)(xue)與翻譯系、新(xin)(xin)加坡(po)科技(ji)研究(jiu)局資訊通(tong)信研究(jiu)院(yuan)從事(shi)自然語言處理、文(wen)本挖掘和機器學(xue)(xue)(xue)(xue)習(xi)的研究(jiu)工作(zuo)，在國(guo)際知名(ming)期刊和會(hui)議如IEEE T-PAMI，ACM TOIS，ACL，IJCAI，SIGIR等發表(biao)論文(wen)30余篇。權小軍(jun)2012年畢業(ye)于(yu)香(xiang)(xiang)港城(cheng)市大學(xue)(xue)(xue)(xue)，獲博士(shi)學(xue)(xue)(xue)(xue)位，回國(guo)前就職于(yu)新(xin)(xin)加坡(po)科技(ji)研究(jiu)局資訊通(tong)信研究(jiu)院(yuan)，任(ren)研究(jiu)科學(xue)(xue)(xue)(xue)家，期間(jian)除從事(shi)相(xiang)關(guan)方向的基(ji)礎研究(jiu)外，也同工業(ye)界(jie)緊密(mi)合作(zuo)探索研究(jiu)成果的應用。

付費5元查看完整內容

優化器 · 方差 · 協方差矩陣 · 分離的 · Continuity ·

2018 年 12 月 18 日

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

Perttu H?m?l?inen,Amin Babadi,Xiaoxiao Ma,Jaakko Lehtinen

Proximal Policy Optimization (PPO) is a highly popular model-free reinforcement learning (RL) approach. However, in continuous state and actions spaces and a Gaussian policy -- common in computer animation and robotics -- PPO is prone to getting stuck in local optima. In this paper, we observe a tendency of PPO to prematurely shrink the exploration variance, which naturally leads to slow progress. Motivated by this, we borrow ideas from CMA-ES, a black-box optimization method designed for intelligent adaptive Gaussian exploration, to derive PPO-CMA, a novel proximal policy optimization approach that can expand the exploration variance on objective function slopes and shrink the variance when close to the optimum. This is implemented by using separate neural networks for policy mean and variance and training the mean and variance in separate passes. Our experiments demonstrate a clear improvement over vanilla PPO in many difficult OpenAI Gym MuJoCo tasks.

Facebook AI Research · 可約的 · 優化器 · 約束優化 · 約束 ·

2018 年 8 月 2 日

Classification with Fairness Constraints: A Meta-Algorithm with Provable Guarantees

L. Elisa Celis,Lingxiao Huang,Vijay Keswani,Nisheeth K. Vishnoi

Developing classification algorithms that are fair with respect to sensitive attributes of the data has become an important problem due to the growing deployment of classification algorithms in various social contexts. Several recent works have focused on fairness with respect to a specific metric, modeled the corresponding fair classification problem as a constrained optimization problem, and developed tailored algorithms to solve them. Despite this, there still remain important metrics for which we do not have fair classifiers and many of the aforementioned algorithms do not come with theoretical guarantees; perhaps because the resulting optimization problem is non-convex. The main contribution of this paper is a new meta-algorithm for classification that takes as input a large class of fairness constraints, with respect to multiple non-disjoint sensitive attributes, and which comes with provable guarantees. This is achieved by first developing a meta-algorithm for a large family of classification problems with convex constraints, and then showing that classification problems with general types of fairness constraints can be reduced to those in this family. We present empirical results that show that our algorithm can achieve near-perfect fairness with respect to various fairness metrics, and that the loss in accuracy due to the imposed fairness constraints is often small. Overall, this work unifies several prior works on fair classification, presents a practical algorithm with theoretical guarantees, and can handle fairness metrics that were previously not possible.

潛變量/隱變量 · 話題模型 · MoDELS · 學成 · 在線 ·

2018 年 4 月 26 日

SpectralLeader: Online Spectral Learning for Single Topic Models

Tong Yu,Branislav Kveton,Zheng Wen,Hung Bui,Ole J. Mengshoel

from arxiv, 17 pages, 2 figures

We study the problem of learning a latent variable model from a stream of data. Latent variable models are popular in practice because they can explain observed data in terms of unobserved concepts. These models have been traditionally studied in the offline setting. In the online setting, on the other hand, the online EM is arguably the most popular algorithm for learning latent variable models. Although the online EM is computationally efficient, it typically converges to a local optimum. In this work, we develop a new online learning algorithm for latent variable models, which we call SpectralLeader. SpectralLeader always converges to the global optimum, and we derive a sublinear upper bound on its $n$-step regret in the bag-of-words model. In both synthetic and real-world experiments, we show that SpectralLeader performs similarly to or better than the online EM with tuned hyper-parameters.

估計/估計量 · Signal Processing · 分段 · Processing（編程語言） · 平滑 ·

2018 年 3 月 14 日

Signal Processing and Piecewise Convex Estimation

Kurt Riedel

Many problems on signal processing reduce to nonparametric function estimation. We propose a new methodology, piecewise convex fitting (PCF), and give a two-stage adaptive estimate. In the first stage, the number and location of the change points is estimated using strong smoothing. In the second stage, a constrained smoothing spline fit is performed with the smoothing level chosen to minimize the MSE. The imposed constraint is that a single change point occurs in a region about each empirical change point of the first-stage estimate. This constraint is equivalent to requiring that the third derivative of the second-stage estimate has a single sign in a small neighborhood about each first-stage change point. We sketch how PCF may be applied to signal recovery, instantaneous frequency estimation, surface reconstruction, image segmentation, spectral estimation and multivariate adaptive regression.

話題模型 · 優化器 · MoDELS · 話題 · 全 ·

2018 年 2 月 28 日

Application of Rényi and Tsallis Entropies to Topic Modeling Optimization

Koltcov Sergei

from arxiv, no comments

This is full length article (draft version) where problem number of topics in Topic Modeling is discussed. We proposed idea that Renyi and Tsallis entropy can be used for identification of optimal number in large textual collections. We also report results of numerical experiments of Semantic stability for 4 topic models, which shows that semantic stability play very important role in problem topic number. The calculation of Renyi and Tsallis entropy based on thermodynamics approach.

學成 · 泛函 · 優化器 · 控制器 · MoDELS ·

2018 年 1 月 29 日

Safety-aware Adaptive Reinforcement Learning with Applications to Brushbot Navigation

Motoya Ohnishi,Li Wang,Gennaro Notomista,Magnus Egerstedt

from arxiv, 14 pages, 10 figures, submitted to IEEE Transactions on Robotics

This paper presents a safety-aware learning framework that employs an adaptive model learning method together with barrier certificates for systems with possibly nonstationary agent dynamics. To extract the dynamic structure of the model, we use a sparse optimization technique, and the resulting model will be used in combination with control barrier certificates which constrain feedback controllers only when safety is about to be violated. Under some mild assumptions, solutions to the constrained feedback-controller optimization are guaranteed to be globally optimal, and the monotonic improvement of a feedback controller is thus ensured. In addition, we reformulate the (action-)value function approximation to make any kernel-based nonlinear function estimation method applicable. We then employ a state-of-the-art kernel adaptive filtering technique for the (action-)value function approximation. The resulting framework is verified experimentally on a brushbot, whose dynamics is unknown and highly complex.

話題模型 · MoDELS · 詞表 · 潛變量/隱變量 · Information Sciences ·

2017 年 12 月 18 日

Multilingual Topic Models

Kriste Krstovski,Michael J. Kurtz,David A. Smith,Alberto Accomazzi

from arxiv, 18 pages, 9 figures

Scientific publications have evolved several features for mitigating vocabulary mismatch when indexing, retrieving, and computing similarity between articles. These mitigation strategies range from simply focusing on high-value article sections, such as titles and abstracts, to assigning keywords, often from controlled vocabularies, either manually or through automatic annotation. Various document representation schemes possess different cost-benefit tradeoffs. In this paper, we propose to model different representations of the same article as translations of each other, all generated from a common latent representation in a multilingual topic model. We start with a methodological overview on latent variable models for parallel document representations that could be used across many information science tasks. We then show how solving the inference problem of mapping diverse representations into a shared topic space allows us to evaluate representations based on how topically similar they are to the original article. In addition, our proposed approach provides means to discover where different concept vocabularies require improvement.

Performer · 估計/估計量 · 經驗風險最小化 · 經驗風險 · 方差 ·

2017 年 12 月 14 日

Variance-based regularization with convex objectives

John Duchi,Hongseok Namkoong

We develop an approach to risk minimization and stochastic optimization that provides a convex surrogate for variance, allowing near-optimal and computationally efficient trading between approximation and estimation error. Our approach builds off of techniques for distributionally robust optimization and Owen's empirical likelihood, and we provide a number of finite-sample and asymptotic results characterizing the theoretical performance of the estimator. In particular, we show that our procedure comes with certificates of optimality, achieving (in some scenarios) faster rates of convergence than empirical risk minimization by virtue of automatically balancing bias and variance. We give corroborating empirical evidence showing that in practice, the estimator indeed trades between variance and absolute performance on a training sample, improving out-of-sample (test) performance over standard empirical risk minimization for a number of classification problems.