人人操人人莫人人草_操下面视频在线观看免费欧美_国产精品午夜福利鲁丝片在线_丁香五月激情综合网在线_久久免费精品一区二区_亚洲一区二区无码视频_国产成人精品亚洲日本语言

In this article we consider Bayesian inference associated to deep neural networks (DNNs) and in particular, trace-class neural network (TNN) priors which were proposed by Sell et al. [39]. Such priors were developed as more robust alternatives to classical architectures in the context of inference problems. For this work we develop multilevel Monte Carlo (MLMC) methods for such models. MLMC is a popular variance reduction technique, with particular applications in Bayesian statistics and uncertainty quantification. We show how a particular advanced MLMC method that was introduced in [4] can be applied to Bayesian inference from DNNs and establish mathematically, that the computational cost to achieve a particular mean square error, associated to posterior expectation computation, can be reduced by several orders, versus more conventional techniques. To verify such results we provide numerous numerical experiments on model problems arising in machine learning. These include Bayesian regression, as well as Bayesian classification and reinforcement learning.

相關內容

Neural Networks

關注 1648

神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)網(wang)絡(luo)(luo)（Neural Networks）是世(shi)界(jie)上三個(ge)最古老的(de)(de)(de)(de)(de)神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)建模學(xue)(xue)會(hui)的(de)(de)(de)(de)(de)檔(dang)案期刊:國(guo)(guo)際神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)網(wang)絡(luo)(luo)學(xue)(xue)會(hui)(INNS)、歐洲神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)網(wang)絡(luo)(luo)學(xue)(xue)會(hui)(ENNS)和(he)日本(ben)神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)網(wang)絡(luo)(luo)學(xue)(xue)會(hui)(JNNS)。神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)網(wang)絡(luo)(luo)提供了(le)一(yi)(yi)個(ge)論壇，以(yi)(yi)發展(zhan)(zhan)和(he)培育(yu)一(yi)(yi)個(ge)國(guo)(guo)際社(she)會(hui)的(de)(de)(de)(de)(de)學(xue)(xue)者和(he)實踐(jian)者感(gan)(gan)興趣的(de)(de)(de)(de)(de)所有方(fang)面的(de)(de)(de)(de)(de)神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)網(wang)絡(luo)(luo)和(he)相關方(fang)法的(de)(de)(de)(de)(de)計(ji)(ji)算(suan)(suan)智能。神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)網(wang)絡(luo)(luo)歡(huan)迎高質量論文(wen)的(de)(de)(de)(de)(de)提交，有助于(yu)全(quan)面的(de)(de)(de)(de)(de)神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)網(wang)絡(luo)(luo)研究，從行為和(he)大腦(nao)建模，學(xue)(xue)習算(suan)(suan)法，通過(guo)數學(xue)(xue)和(he)計(ji)(ji)算(suan)(suan)分(fen)析(xi)，系(xi)統的(de)(de)(de)(de)(de)工(gong)程(cheng)和(he)技術(shu)應用，大量使用神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)網(wang)絡(luo)(luo)的(de)(de)(de)(de)(de)概念(nian)和(he)技術(shu)。這一(yi)(yi)獨特(te)而廣泛的(de)(de)(de)(de)(de)范(fan)圍促進了(le)生物和(he)技術(shu)研究之(zhi)間的(de)(de)(de)(de)(de)思想交流(liu)，并有助于(yu)促進對生物啟(qi)發的(de)(de)(de)(de)(de)計(ji)(ji)算(suan)(suan)智能感(gan)(gan)興趣的(de)(de)(de)(de)(de)跨學(xue)(xue)科社(she)區的(de)(de)(de)(de)(de)發展(zhan)(zhan)。因此(ci)，神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)網(wang)絡(luo)(luo)編委會(hui)代表的(de)(de)(de)(de)(de)專家領域包括心理學(xue)(xue)，神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)生物學(xue)(xue)，計(ji)(ji)算(suan)(suan)機科學(xue)(xue)，工(gong)程(cheng)，數學(xue)(xue)，物理。該雜志發表文(wen)章、信(xin)(xin)件和(he)評論以(yi)(yi)及給編輯的(de)(de)(de)(de)(de)信(xin)(xin)件、社(she)論、時事、軟件調查和(he)專利信(xin)(xin)息。文(wen)章發表在(zai)五個(ge)部分(fen)之(zhi)一(yi)(yi):認知科學(xue)(xue)，神(shen)(shen)(shen)經(jing)(jing)(jing)(jing)科學(xue)(xue)，學(xue)(xue)習系(xi)統，數學(xue)(xue)和(he)計(ji)(ji)算(suan)(suan)分(fen)析(xi)、工(gong)程(cheng)和(he)應用。官網(wang)地址：

變分自編碼 · Learning · Processing（編程語言） · 泛函 · MCMC ·

2022 年 9 月 13 日

$π$VAE: a stochastic process prior for Bayesian deep learning with MCMC

Swapnil Mishra,Seth Flaxman,Tresnia Berah,Harrison Zhu,Mikko Pakkanen,Samir Bhatt

Stochastic processes provide a mathematically elegant way model complex data. In theory, they provide flexible priors over function classes that can encode a wide range of interesting assumptions. In practice, however, efficient inference by optimisation or marginalisation is difficult, a problem further exacerbated with big data and high dimensional input spaces. We propose a novel variational autoencoder (VAE) called the prior encoding variational autoencoder ($\pi$VAE). The $\pi$VAE is finitely exchangeable and Kolmogorov consistent, and thus is a continuous stochastic process. We use $\pi$VAE to learn low dimensional embeddings of function classes. We show that our framework can accurately learn expressive function classes such as Gaussian processes, but also properties of functions to enable statistical inference (such as the integral of a log Gaussian process). For popular tasks, such as spatial interpolation, $\pi$VAE achieves state-of-the-art performance both in terms of accuracy and computational efficiency. Perhaps most usefully, we demonstrate that the low dimensional independently distributed latent space representation learnt provides an elegant and scalable means of performing Bayesian inference for stochastic processes within probabilistic programming languages such as Stan.

圖 · 學成 · MoDELS · Extensibility · 深度學習 ·

2022 年 2 月 24 日

Bayesian Deep Learning for Graphs

Federico Errica

from arxiv, PhD Thesis

The adaptive processing of structured data is a long-standing research topic in machine learning that investigates how to automatically learn a mapping from a structured input to outputs of various nature. Recently, there has been an increasing interest in the adaptive processing of graphs, which led to the development of different neural network-based methodologies. In this thesis, we take a different route and develop a Bayesian Deep Learning framework for graph learning. The dissertation begins with a review of the principles over which most of the methods in the field are built, followed by a study on graph classification reproducibility issues. We then proceed to bridge the basic ideas of deep learning for graphs with the Bayesian world, by building our deep architectures in an incremental fashion. This framework allows us to consider graphs with discrete and continuous edge features, producing unsupervised embeddings rich enough to reach the state of the art on several classification tasks. Our approach is also amenable to a Bayesian nonparametric extension that automatizes the choice of almost all model's hyper-parameters. Two real-world applications demonstrate the efficacy of deep learning for graphs. The first concerns the prediction of information-theoretic quantities for molecular simulations with supervised neural models. After that, we exploit our Bayesian models to solve a malware-classification task while being robust to intra-procedural code obfuscation techniques. We conclude the dissertation with an attempt to blend the best of the neural and Bayesian worlds together. The resulting hybrid model is able to predict multimodal distributions conditioned on input graphs, with the consequent ability to model stochasticity and uncertainty better than most works. Overall, we aim to provide a Bayesian perspective into the articulated research field of deep learning for graphs.

MoDELS · 學成 · Networking · 動力系統 · Neural Networks ·

2022 年 2 月 4 日

On Neural Differential Equations

Patrick Kidger

from arxiv, Doctoral thesis, Mathematical Institute, University of Oxford. 231 pages

The conjoining of dynamical systems and deep learning has become a topic of great interest. In particular, neural differential equations (NDEs) demonstrate that neural networks and differential equation are two sides of the same coin. Traditional parameterised differential equations are a special case. Many popular neural network architectures, such as residual networks and recurrent networks, are discretisations. NDEs are suitable for tackling generative problems, dynamical systems, and time series (particularly in physics, finance, ...) and are thus of interest to both modern machine learning and traditional mathematical modelling. NDEs offer high-capacity function approximation, strong priors on model space, the ability to handle irregular data, memory efficiency, and a wealth of available theory on both sides. This doctoral thesis provides an in-depth survey of the field. Topics include: neural ordinary differential equations (e.g. for hybrid neural/mechanistic modelling of physical systems); neural controlled differential equations (e.g. for learning functions of irregular time series); and neural stochastic differential equations (e.g. to produce generative models capable of representing complex stochastic dynamics, or sampling from complex high-dimensional distributions). Further topics include: numerical methods for NDEs (e.g. reversible differential equations solvers, backpropagation through differential equations, Brownian reconstruction); symbolic regression for dynamical systems (e.g. via regularised evolution); and deep implicit models (e.g. deep equilibrium models, differentiable optimisation). We anticipate this thesis will be of interest to anyone interested in the marriage of deep learning with dynamical systems, and hope it will provide a useful reference for the current state of the art.

Microsoft Surface · Neural Networks · Networking · MoDELS · 損失函數（機器學習） ·

2021 年 5 月 28 日

Incorporating prior financial domain knowledge into neural networks for implied volatility surface prediction

Yu Zheng,Yongxin Yang,Bowei Chen

from arxiv, 8 pages, SIGKDD 2021

In this paper we develop a novel neural network model for predicting implied volatility surface. Prior financial domain knowledge is taken into account. A new activation function that incorporates volatility smile is proposed, which is used for the hidden nodes that process the underlying asset price. In addition, financial conditions, such as the absence of arbitrage, the boundaries and the asymptotic slope, are embedded into the loss function. This is one of the very first studies which discuss a methodological framework that incorporates prior financial domain knowledge into neural network architecture design and model training. The proposed model outperforms the benchmarked models with the option data on the S&P 500 index over 20 years. More importantly, the domain knowledge is satisfied empirically, showing the model is consistent with the existing financial theories and conditions related to implied volatility surface.

Weight · 推斷 · 近似 · 貝葉斯推斷 · Performer ·

2021 年 2 月 18 日

Bayesian Deep Learning via Subnetwork Inference

Erik Daxberger,Eric Nalisnick,James Urquhart Allingham,Javier Antorán,José Miguel Hernández-Lobato

from arxiv, 21 pages, extended version with supplementary material

The Bayesian paradigm has the potential to solve core issues of deep neural networks such as poor calibration and data inefficiency. Alas, scaling Bayesian inference to large weight spaces often requires restrictive approximations. In this work, we show that it suffices to perform inference over a small subset of model weights in order to obtain accurate predictive posteriors. The other weights are kept as point estimates. This subnetwork inference framework enables us to use expressive, otherwise intractable, posterior approximations over such subsets. In particular, we implement subnetwork linearized Laplace: We first obtain a MAP estimate of all weights and then infer a full-covariance Gaussian posterior over a subnetwork. We propose a subnetwork selection strategy that aims to maximally preserve the model's predictive uncertainty. Empirically, our approach is effective compared to ensembles and less expressive posterior approximations over full networks.

學成 · 推斷 · 深度學習 · MoDELS · Performer ·

2020 年 7 月 2 日

A Survey on Bayesian Deep Learning

Hao Wang,Dit-Yan Yeung

from arxiv, To appear in ACM Computing Surveys (CSUR) 2020

A comprehensive artificial intelligence system needs to not only perceive the environment with different `senses' (e.g., seeing and hearing) but also infer the world's conditional (or even causal) relations and corresponding uncertainty. The past decade has seen major advances in many perception tasks such as visual object recognition and speech recognition using deep learning models. For higher-level inference, however, probabilistic graphical models with their Bayesian nature are still more powerful and flexible. In recent years, Bayesian deep learning has emerged as a unified probabilistic framework to tightly integrate deep learning and Bayesian models. In this general framework, the perception of text or images using deep learning can boost the performance of higher-level inference and in turn, the feedback from the inference process is able to enhance the perception of text or images. This survey provides a comprehensive introduction to Bayesian deep learning and reviews its recent applications on recommender systems, topic models, control, etc. Besides, we also discuss the relationship and differences between Bayesian deep learning and other related topics such as Bayesian treatment of neural networks.

Performer · Neural Networks · 圖 · Networking · INFORMS ·

2020 年 6 月 19 日

Subgraph Neural Networks

Emily Alsentzer,Samuel G. Finlayson,Michelle M. Li,Marinka Zitnik

from arxiv, E.A. and S.G.F. contributed equally

Deep learning methods for graphs achieve remarkable performance on many node-level and graph-level prediction tasks. However, despite the proliferation of the methods and their success, prevailing Graph Neural Networks (GNNs) neglect subgraphs, rendering subgraph prediction tasks challenging to tackle in many impactful applications. Further, subgraph prediction tasks present several unique challenges, because subgraphs can have non-trivial internal topology, but also carry a notion of position and external connectivity information relative to the underlying graph in which they exist. Here, we introduce SUB-GNN, a subgraph neural network to learn disentangled subgraph representations. In particular, we propose a novel subgraph routing mechanism that propagates neural messages between the subgraph's components and randomly sampled anchor patches from the underlying graph, yielding highly accurate subgraph representations. SUB-GNN specifies three channels, each designed to capture a distinct aspect of subgraph structure, and we provide empirical evidence that the channels encode their intended properties. We design a series of new synthetic and real-world subgraph datasets. Empirical results for subgraph classification on eight datasets show that SUB-GNN achieves considerable performance gains, outperforming strong baseline methods, including node-level and graph-level GNNs, by 12.4% over the strongest baseline. SUB-GNN performs exceptionally well on challenging biomedical datasets when subgraphs have complex topology and even comprise multiple disconnected components.

簇 · 圖 · SC · 圖形處理器 · 匯聚 ·

2020 年 6 月 3 日

Spectral Clustering with Graph Neural Networks for Graph Pooling

Filippo Maria Bianchi,Daniele Grattarola,Cesare Alippi

Spectral clustering (SC) is a popular clustering technique to find strongly connected communities on a graph. SC can be used in Graph Neural Networks (GNNs) to implement pooling operations that aggregate nodes belonging to the same cluster. However, the eigendecomposition of the Laplacian is expensive and, since clustering results are graph-specific, pooling methods based on SC must perform a new optimization for each new sample. In this paper, we propose a graph clustering approach that addresses these limitations of SC. We formulate a continuous relaxation of the normalized minCUT problem and train a GNN to compute cluster assignments that minimize this objective. Our GNN-based implementation is differentiable, does not require to compute the spectral decomposition, and learns a clustering function that can be quickly evaluated on out-of-sample graphs. From the proposed clustering method, we design a graph pooling operator that overcomes some important limitations of state-of-the-art graph pooling techniques and achieves the best performance in several supervised and unsupervised tasks.

圖 · 學成 · Neural Networks · 深度學習 · Networking ·

2018 年 12 月 11 日

Deep Learning on Graphs: A Survey

Ziwei Zhang,Peng Cui,Wenwu Zhu

from arxiv, 15 pages, 10 figures

Deep learning has been shown successful in a number of domains, ranging from acoustics, images to natural language processing. However, applying deep learning to the ubiquitous graph data is non-trivial because of the unique characteristics of graphs. Recently, a significant amount of research efforts have been devoted to this area, greatly advancing graph analyzing techniques. In this survey, we comprehensively review different kinds of deep learning methods applied to graphs. We divide existing methods into three main categories: semi-supervised methods including Graph Neural Networks and Graph Convolutional Networks, unsupervised methods including Graph Autoencoders, and recent advancements including Graph Recurrent Neural Networks and Graph Reinforcement Learning. We then provide a comprehensive overview of these methods in a systematic manner following their history of developments. We also analyze the differences of these methods and how to composite different architectures. Finally, we briefly outline their applications and discuss potential future directions.

Neural Networks · Networking · 卷積 · 卷積神經網絡 · Extensibility ·

2018 年 6 月 27 日

Bayesian Convolutional Neural Networks

Felix Laumann,Kumar Shridhar,Adrian Llopart Maurin

We propose a Bayesian convolutional neural network built upon Bayes by Backprop and elaborate how this known method can serve as the fundamental construct of our novel, reliable variational inference method for convolutional neural networks. First, we show how Bayes by Backprop can be applied to convolutional layers where weights in filters have probability distributions instead of point-estimates; and second, how our proposed framework leads with various network architectures to performances comparable to convolutional neural networks with point-estimates weights. In the past, Bayes by Backprop has been successfully utilised in feedforward and recurrent neural networks, but not in convolutional ones. This work symbolises the extension of the group of Bayesian neural networks which encompasses all three aforementioned types of network architectures now.