亚洲十八禁无码在线免费观看_精品国产一区二区三区日日嗨_永久黄色网站电影免费观看_亚洲中文无码永久免费视频_色欲永久精品无码免费毛片_国产亚洲精久久久久久无码色戒_女同性AV片在线观看免费网站

We consider Bayesian inverse problems wherein the unknown state is assumed to be a function with discontinuous structure a priori. A class of prior distributions based on the output of neural networks with heavy-tailed weights is introduced, motivated by existing results concerning the infinite-width limit of such networks. We show theoretically that samples from such priors have desirable discontinuous-like properties even when the network width is finite, making them appropriate for edge-preserving inversion. Numerically we consider deconvolution problems defined on one- and two-dimensional spatial domains to illustrate the effectiveness of these priors; MAP estimation, dimension-robust MCMC sampling and ensemble-based approximations are utilized to probe the posterior distribution. The accuracy of point estimates is shown to exceed those obtained from non-heavy tailed priors, and uncertainty estimates are shown to provide more useful qualitative information.

相關內容

Neural Networks

關注 1648

神經(jing)網(wang)(wang)絡(luo)（Neural Networks）是世界上(shang)三個(ge)(ge)最(zui)古老的(de)(de)(de)神經(jing)建模學(xue)(xue)會(hui)(hui)(hui)的(de)(de)(de)檔(dang)案期刊:國(guo)際(ji)(ji)神經(jing)網(wang)(wang)絡(luo)學(xue)(xue)會(hui)(hui)(hui)(INNS)、歐(ou)洲(zhou)神經(jing)網(wang)(wang)絡(luo)學(xue)(xue)會(hui)(hui)(hui)(ENNS)和(he)(he)(he)(he)日本神經(jing)網(wang)(wang)絡(luo)學(xue)(xue)會(hui)(hui)(hui)(JNNS)。神經(jing)網(wang)(wang)絡(luo)提供了一個(ge)(ge)論壇，以(yi)發(fa)展和(he)(he)(he)(he)培育一個(ge)(ge)國(guo)際(ji)(ji)社會(hui)(hui)(hui)的(de)(de)(de)學(xue)(xue)者和(he)(he)(he)(he)實踐者感(gan)(gan)興(xing)趣(qu)的(de)(de)(de)所有方面的(de)(de)(de)神經(jing)網(wang)(wang)絡(luo)和(he)(he)(he)(he)相關方法的(de)(de)(de)計(ji)算(suan)智能(neng)。神經(jing)網(wang)(wang)絡(luo)歡迎高質量論文的(de)(de)(de)提交(jiao)，有助于全面的(de)(de)(de)神經(jing)網(wang)(wang)絡(luo)研(yan)究(jiu)，從(cong)行為(wei)和(he)(he)(he)(he)大(da)腦建模，學(xue)(xue)習(xi)算(suan)法，通過數學(xue)(xue)和(he)(he)(he)(he)計(ji)算(suan)分(fen)析，系統的(de)(de)(de)工(gong)程和(he)(he)(he)(he)技術應(ying)用(yong)，大(da)量使用(yong)神經(jing)網(wang)(wang)絡(luo)的(de)(de)(de)概念和(he)(he)(he)(he)技術。這一獨特而廣泛(fan)的(de)(de)(de)范(fan)圍促進了生物和(he)(he)(he)(he)技術研(yan)究(jiu)之間的(de)(de)(de)思想交(jiao)流(liu)，并有助于促進對生物啟發(fa)的(de)(de)(de)計(ji)算(suan)智能(neng)感(gan)(gan)興(xing)趣(qu)的(de)(de)(de)跨(kua)學(xue)(xue)科(ke)(ke)(ke)社區的(de)(de)(de)發(fa)展。因此，神經(jing)網(wang)(wang)絡(luo)編委(wei)會(hui)(hui)(hui)代表的(de)(de)(de)專家領(ling)域包括(kuo)心(xin)理(li)學(xue)(xue)，神經(jing)生物學(xue)(xue)，計(ji)算(suan)機科(ke)(ke)(ke)學(xue)(xue)，工(gong)程，數學(xue)(xue)，物理(li)。該雜志發(fa)表文章、信件和(he)(he)(he)(he)評論以(yi)及給編輯的(de)(de)(de)信件、社論、時(shi)事、軟件調查和(he)(he)(he)(he)專利信息。文章發(fa)表在五個(ge)(ge)部分(fen)之一:認知科(ke)(ke)(ke)學(xue)(xue)，神經(jing)科(ke)(ke)(ke)學(xue)(xue)，學(xue)(xue)習(xi)系統，數學(xue)(xue)和(he)(he)(he)(he)計(ji)算(suan)分(fen)析、工(gong)程和(he)(he)(he)(he)應(ying)用(yong)。官網(wang)(wang)地址：

PDE · Continuity · MoDELS · 噪聲 · 優化器 ·

2022 年 2 月 22 日

Optimal Interpolation Data for PDE-based Compression of Images with Noise

Zakaria Belhachmi,Thomas Jacumin

from arxiv, arXiv admin note: text overlap with arXiv:2011.02363

We introduce and discuss shape-based models for finding the best interpolation data in the compression of images with noise. The aim is to reconstruct missing regions by means of minimizing a data fitting term in the $L^2$-norm between the images and their reconstructed counterparts using time-dependent PDE inpainting. We analyze the proposed models in the framework of the $\Gamma$-convergence from two different points of view. First, we consider a continuous stationary PDE model, obtained by focusing on the first iteration of the discretized time-dependent PDE, and get pointwise information on the "relevance" of each pixel by a topological asymptotic method. Second, we introduce a finite dimensional setting of the continuous model based on "fat pixels" (balls with positive radius), and we study by $\Gamma$-convergence the asymptotics when the radius vanishes. Numerical computations are presented that confirm the usefulness of our theoretical findings for non-stationary PDE-based image compression.

Performer · 推斷 · Extensibility · 馬爾可夫鏈蒙特卡羅 · 重要性采樣 ·

2022 年 2 月 21 日

A Predictive Approach to Bayesian Nonparametric Survival Analysis

Edwin Fong,Brieuc Lehmann

from arxiv, Accepted to AISTATS 2022

Bayesian nonparametric methods are a popular choice for analysing survival data due to their ability to flexibly model the distribution of survival times. These methods typically employ a nonparametric prior on the survival function that is conjugate with respect to right-censored data. Eliciting these priors, particularly in the presence of covariates, can be challenging and inference typically relies on computationally intensive Markov chain Monte Carlo schemes. In this paper, we build on recent work that recasts Bayesian inference as assigning a predictive distribution on the unseen values of a population conditional on the observed samples, thus avoiding the need to specify a complex prior. We describe a copula-based predictive update which admits a scalable sequential importance sampling algorithm to perform inference that properly accounts for right-censoring. We provide theoretical justification through an extension of Doob's consistency theorem and illustrate the method on a number of simulated and real data sets, including an example with covariates. Our approach enables analysts to perform Bayesian nonparametric inference through only the specification of a predictive distribution.

Neural Networks · CASE · 貝葉斯推斷 · Networking · 推斷 ·

2022 年 2 月 21 日

On out-of-distribution detection with Bayesian neural networks

Francesco D'Angelo,Christian Henning

from arxiv, This work is an extension of our previous workshop contribution: arXiv:2107.12248

The question whether inputs are valid for the problem a neural network is trying to solve has sparked interest in out-of-distribution (OOD) detection. It is widely assumed that Bayesian neural networks (BNNs) are well suited for this task, as the endowed epistemic uncertainty should lead to disagreement in predictions on outliers. In this paper, we question this assumption and show that proper Bayesian inference with function space priors induced by neural networks does not necessarily lead to good OOD detection. To circumvent the use of approximate inference, we start by studying the infinite-width case, where Bayesian inference can be exact due to the correspondence with Gaussian processes. Strikingly, the kernels derived from common architectural choices lead to function space priors which induce predictive uncertainties that do not reflect the underlying input data distribution and are therefore unsuited for OOD detection. Importantly, we find the OOD behavior in this limiting case to be consistent with the corresponding finite-width case. To overcome this limitation, useful function space properties can also be encoded in the prior in weight space, however, this can currently only be applied to a specified subset of the domain and thus does not inherently extend to OOD data. Finally, we argue that a trade-off between generalization and OOD capabilities might render the application of BNNs for OOD detection undesirable in practice. Overall, our study discloses fundamental problems when naively using BNNs for OOD detection and opens interesting avenues for future research.

去噪 · Networking · 可約的 · MoDELS · INFORMS ·

2022 年 2 月 20 日

MANet: Improving Video Denoising with a Multi-Alignment Network

Yaping Zhao,Haitian Zheng,Zhongrui Wang,Jiebo Luo,Edmund Y. Lam

from arxiv, 4 pages, 5 figures

In video denoising, the adjacent frames often provide very useful information, but accurate alignment is needed before such information can be harnassed. In this work, we present a multi-alignment network, which generates multiple flow proposals followed by attention-based averaging. It serves to mimics the non-local mechanism, suppressing noise by averaging multiple observations. Our approach can be applied to various state-of-the-art models that are based on flow estimation. Experiments on a large-scale video dataset demonstrate that our method improves the denoising baseline model by 0.2dB, and further reduces the parameters by 47% with model distillation.

估計/估計量 · 統計量 · Neural Networks · Networking · 推斷 ·

2022 年 2 月 18 日

A Free Lunch with Influence Functions? Improving Neural Network Estimates with Concepts from Semiparametric Statistics

Matthew J. Vowels,Sina Akbari,Jalal Etesami,Necati Cihan Camgoz,Richard Bowden

Parameter estimation in the empirical fields is usually undertaken using parametric models, and such models are convenient because they readily facilitate statistical inference. Unfortunately, they are unlikely to have a sufficiently flexible functional form to be able to adequately model real-world phenomena, and their usage may therefore result in biased estimates and invalid inference. Unfortunately, whilst non-parametric machine learning models may provide the needed flexibility to adapt to the complexity of real-world phenomena, they do not readily facilitate statistical inference, and may still exhibit residual bias. We explore the potential for semiparametric theory (in particular, the Influence Function) to be used to improve neural networks and machine learning algorithms in terms of (a) improving initial estimates without needing more data (b) increasing the robustness of our models, and (c) yielding confidence intervals for statistical inference. We propose a new neural network method MultiNet, which seeks the flexibility and diversity of an ensemble using a single architecture. Results on causal inference tasks indicate that MultiNet yields better performance than other approaches, and that all considered methods are amenable to improvement from semiparametric techniques under certain conditions. In other words, with these techniques we show that we can improve existing neural networks for `free', without needing more data, and without needing to retrain them. Finally, we provide the expression for deriving influence functions for estimands from a general graph, and the code to do so automatically.

Weight · 推斷 · 近似 · 貝葉斯推斷 · Performer ·

2021 年 2 月 18 日

Bayesian Deep Learning via Subnetwork Inference

Erik Daxberger,Eric Nalisnick,James Urquhart Allingham,Javier Antorán,José Miguel Hernández-Lobato

from arxiv, 21 pages, extended version with supplementary material

The Bayesian paradigm has the potential to solve core issues of deep neural networks such as poor calibration and data inefficiency. Alas, scaling Bayesian inference to large weight spaces often requires restrictive approximations. In this work, we show that it suffices to perform inference over a small subset of model weights in order to obtain accurate predictive posteriors. The other weights are kept as point estimates. This subnetwork inference framework enables us to use expressive, otherwise intractable, posterior approximations over such subsets. In particular, we implement subnetwork linearized Laplace: We first obtain a MAP estimate of all weights and then infer a full-covariance Gaussian posterior over a subnetwork. We propose a subnetwork selection strategy that aims to maximally preserve the model's predictive uncertainty. Empirically, our approach is effective compared to ensembles and less expressive posterior approximations over full networks.

離散化 · 圖 · 圖形處理器 · Neural Networks · Networking ·

2019 年 3 月 28 日

Learning Discrete Structures for Graph Neural Networks

Luca Franceschi,Mathias Niepert,Massimiliano Pontil,Xiao He

from arxiv, 18 pages

Graph neural networks (GNNs) are a popular class of machine learning models whose major advantage is their ability to incorporate a sparse and discrete dependency structure between data points. Unfortunately, GNNs can only be used when such a graph-structure is available. In practice, however, real-world graphs are often noisy and incomplete or might not be available at all. With this work, we propose to jointly learn the graph structure and the parameters of graph convolutional networks (GCNs) by approximately solving a bilevel program that learns a discrete probability distribution on the edges of the graph. This allows one to apply GCNs not only in scenarios where the given graph is incomplete or corrupted but also in those where a graph is not available. We conduct a series of experiments that analyze the behavior of the proposed method and demonstrate that it outperforms related methods by a significant margin.

Neural Networks · Networking · 卷積 · 卷積神經網絡 · Extensibility ·

2018 年 6 月 27 日

Bayesian Convolutional Neural Networks

Felix Laumann,Kumar Shridhar,Adrian Llopart Maurin

We propose a Bayesian convolutional neural network built upon Bayes by Backprop and elaborate how this known method can serve as the fundamental construct of our novel, reliable variational inference method for convolutional neural networks. First, we show how Bayes by Backprop can be applied to convolutional layers where weights in filters have probability distributions instead of point-estimates; and second, how our proposed framework leads with various network architectures to performances comparable to convolutional neural networks with point-estimates weights. In the past, Bayes by Backprop has been successfully utilised in feedforward and recurrent neural networks, but not in convolutional ones. This work symbolises the extension of the group of Bayesian neural networks which encompasses all three aforementioned types of network architectures now.

磁流變材料 · 變分自編碼 · 學成 · 正則化項 · Neural Networks ·

2018 年 1 月 17 日

MR image reconstruction using deep density priors

Kerem C. Tezcan,Christian F. Baumgartner,Ender Konukoglu

from arxiv, 26 pages, 5 main figures, 7 supplementary figures, 1 table

Purpose: MR image reconstruction exploits regularization to compensate for missing k-space data. In this work, we propose to learn the probability distribution of MR image patches with neural networks and use this distribution as prior information constraining images during reconstruction, effectively employing it as regularization. Methods: We use variational autoencoders (VAE) to learn the distribution of MR image patches, which models the high-dimensional distribution by a latent parameter model of lower dimensions in a non-linear fashion. The proposed algorithm uses the learned prior in a Maximum-A-Posteriori estimation formulation. We evaluate the proposed reconstruction method with T1 weighted images and also apply our method on images with white matter lesions. Results: Visual evaluation of the samples showed that the VAE algorithm can approximate the distribution of MR patches well. The proposed reconstruction algorithm using the VAE prior produced high quality reconstructions. The algorithm achieved normalized RMSE, CNR and CN values of 2.77\%, 0.43, 0.11; 4.29\%, 0.43, 0.11, 6.36\%, 0.47, 0.11 and 10.00\%, 0.42, 0.10 for undersampling ratios of 2, 3, 4 and 5, respectively, where it outperformed most of the alternative methods. In the experiments on images with white matter lesions, the method faithfully reconstructed the lesions. Conclusion: We introduced a novel method for MR reconstruction, which takes a new perspective on regularization by using priors learned by neural networks. Results suggest the method compares favorably against the other evaluated methods and can reconstruct lesions as well. Keywords: Reconstruction, MRI, prior probability, MAP estimation, machine learning, variational inference, deep learning

離散化 · 馬爾可夫鏈蒙特卡羅 · 潛在 · 可交換的 · 話題模型 ·

2018 年 1 月 15 日

Latent nested nonparametric priors

Federico Camerlenghi,David B. Dunson,Antonio Lijoi,Igor Prünster,Abel Rodríguez

Discrete random structures are important tools in Bayesian nonparametrics and the resulting models have proven effective in density estimation, clustering, topic modeling and prediction, among others. In this paper, we consider nested processes and study the dependence structures they induce. Dependence ranges between homogeneity, corresponding to full exchangeability, and maximum heterogeneity, corresponding to (unconditional) independence across samples. The popular nested Dirichlet process is shown to degenerate to the fully exchangeable case when there are ties across samples at the observed or latent level. To overcome this drawback, inherent to nesting general discrete random measures, we introduce a novel class of latent nested processes. These are obtained by adding common and group-specific completely random measures and, then, normalising to yield dependent random probability measures. We provide results on the partition distributions induced by latent nested processes, and develop an Markov Chain Monte Carlo sampler for Bayesian inferences. A test for distributional homogeneity across groups is obtained as a by product. The results and their inferential implications are showcased on synthetic and real data.