黄色视频在线观看男人插女人的视频在线观看_亚洲AV永久无码精品九之_亚洲午夜在线视频_中国电影三级中文字幕HD_在线曰批视频大全免费_黄色视频你懂的列表_精品国内乱码一区二区三区

In this paper, we study random neural networks which are single-hidden-layer feedforward neural networks whose weights and biases are randomly initialized. After this random initialization, only the linear readout needs to be trained, which can be performed efficiently, e.g., by the least squares method. By viewing random neural networks as Banach space-valued random variables, we prove a universal approximation theorem within a large class of Bochner spaces. Hereby, the corresponding Banach space can be significantly more general than the space of continuous functions over a compact subset of a Euclidean space, namely, e.g., an $L^p$-space or a Sobolev space, where the latter includes the approximation of the derivatives. Moreover, we derive approximation rates and an explicit algorithm to learn a deterministic function by a random neural network. In addition, we provide a full error analysis and study when random neural networks overcome the curse of dimensionality in the sense that the training costs scale at most polynomially in the input and output dimension. Furthermore, we show in two numerical examples the empirical advantages of random neural networks compared to fully trained deterministic neural networks.

相關內容

Neural Networks

關注 1648

神(shen)(shen)經(jing)網(wang)絡(luo)(luo)（Neural Networks）是世界上三(san)個(ge)最古(gu)老的(de)(de)(de)(de)(de)神(shen)(shen)經(jing)建模學(xue)(xue)(xue)會(hui)的(de)(de)(de)(de)(de)檔案期刊:國際神(shen)(shen)經(jing)網(wang)絡(luo)(luo)學(xue)(xue)(xue)會(hui)(INNS)、歐洲神(shen)(shen)經(jing)網(wang)絡(luo)(luo)學(xue)(xue)(xue)會(hui)(ENNS)和(he)(he)(he)日本神(shen)(shen)經(jing)網(wang)絡(luo)(luo)學(xue)(xue)(xue)會(hui)(JNNS)。神(shen)(shen)經(jing)網(wang)絡(luo)(luo)提(ti)供了一(yi)(yi)個(ge)論壇(tan)，以發(fa)(fa)展(zhan)(zhan)和(he)(he)(he)培(pei)育一(yi)(yi)個(ge)國際社(she)會(hui)的(de)(de)(de)(de)(de)學(xue)(xue)(xue)者和(he)(he)(he)實(shi)踐者感(gan)興趣的(de)(de)(de)(de)(de)所(suo)有方(fang)面的(de)(de)(de)(de)(de)神(shen)(shen)經(jing)網(wang)絡(luo)(luo)和(he)(he)(he)相關方(fang)法(fa)的(de)(de)(de)(de)(de)計(ji)算(suan)(suan)(suan)智(zhi)能(neng)。神(shen)(shen)經(jing)網(wang)絡(luo)(luo)歡迎(ying)高質量(liang)論文的(de)(de)(de)(de)(de)提(ti)交，有助于全面的(de)(de)(de)(de)(de)神(shen)(shen)經(jing)網(wang)絡(luo)(luo)研究，從行為和(he)(he)(he)大腦建模，學(xue)(xue)(xue)習算(suan)(suan)(suan)法(fa)，通過數學(xue)(xue)(xue)和(he)(he)(he)計(ji)算(suan)(suan)(suan)分(fen)析(xi)，系(xi)統(tong)的(de)(de)(de)(de)(de)工程(cheng)和(he)(he)(he)技術(shu)應(ying)用(yong)，大量(liang)使(shi)用(yong)神(shen)(shen)經(jing)網(wang)絡(luo)(luo)的(de)(de)(de)(de)(de)概念和(he)(he)(he)技術(shu)。這(zhe)一(yi)(yi)獨特而廣泛的(de)(de)(de)(de)(de)范圍促進(jin)了生(sheng)物和(he)(he)(he)技術(shu)研究之間的(de)(de)(de)(de)(de)思(si)想交流，并有助于促進(jin)對生(sheng)物啟發(fa)(fa)的(de)(de)(de)(de)(de)計(ji)算(suan)(suan)(suan)智(zhi)能(neng)感(gan)興趣的(de)(de)(de)(de)(de)跨學(xue)(xue)(xue)科(ke)(ke)社(she)區的(de)(de)(de)(de)(de)發(fa)(fa)展(zhan)(zhan)。因此，神(shen)(shen)經(jing)網(wang)絡(luo)(luo)編(bian)委會(hui)代表(biao)的(de)(de)(de)(de)(de)專(zhuan)家領(ling)域包括(kuo)心理學(xue)(xue)(xue)，神(shen)(shen)經(jing)生(sheng)物學(xue)(xue)(xue)，計(ji)算(suan)(suan)(suan)機科(ke)(ke)學(xue)(xue)(xue)，工程(cheng)，數學(xue)(xue)(xue)，物理。該雜志發(fa)(fa)表(biao)文章、信(xin)件(jian)和(he)(he)(he)評論以及給編(bian)輯(ji)的(de)(de)(de)(de)(de)信(xin)件(jian)、社(she)論、時事(shi)、軟(ruan)件(jian)調查(cha)和(he)(he)(he)專(zhuan)利信(xin)息。文章發(fa)(fa)表(biao)在五個(ge)部分(fen)之一(yi)(yi):認知科(ke)(ke)學(xue)(xue)(xue)，神(shen)(shen)經(jing)科(ke)(ke)學(xue)(xue)(xue)，學(xue)(xue)(xue)習系(xi)統(tong)，數學(xue)(xue)(xue)和(he)(he)(he)計(ji)算(suan)(suan)(suan)分(fen)析(xi)、工程(cheng)和(he)(he)(he)應(ying)用(yong)。官網(wang)地址：

通用近似器 · Networking · 近似 · 控制器 · Neural Networks ·

2024 年 2 月 9 日

Universal Approximation Power of Deep Residual Neural Networks via Nonlinear Control Theory

Paulo Tabuada,Bahman Gharesifard

from arxiv, Sejun Park and Geonho Hwang brought to our atention a mistake in the proof of Theorem 5.1. This mistake is corrected in this version with the consequence of increasing the number of neurons per layer from n+1 to 2n+1

In this paper, we explain the universal approximation capabilities of deep residual neural networks through geometric nonlinear control. Inspired by recent work establishing links between residual networks and control systems, we provide a general sufficient condition for a residual network to have the power of universal approximation by asking the activation function, or one of its derivatives, to satisfy a quadratic differential equation. Many activation functions used in practice satisfy this assumption, exactly or approximately, and we show this property to be sufficient for an adequately deep neural network with $n+1$ neurons per layer to approximate arbitrarily well, on a compact set and with respect to the supremum norm, any continuous function from $\mathbb{R}^n$ to $\mathbb{R}^n$. We further show this result to hold for very simple architectures for which the weights only need to assume two values. The first key technical contribution consists of relating the universal approximation problem to controllability of an ensemble of control systems corresponding to a residual network and to leverage classical Lie algebraic techniques to characterize controllability. The second technical contribution is to identify monotonicity as the bridge between controllability of finite ensembles and uniform approximability on compact sets.

XAI · Facebook AI Research · 評論員 · 可辨認的 · Seven ·

2024 年 2 月 7 日

A Critical Survey on Fairness Benefits of XAI

Luca Deck,Jakob Schoeffer,Maria De-Arteaga,Niklas Kühl

In this critical survey, we analyze typical claims on the relationship between explainable AI (XAI) and fairness to disentangle the multidimensional relationship between these two concepts. Based on a systematic literature review and a subsequent qualitative content analysis, we identify seven archetypal claims from 175 papers on the alleged fairness benefits of XAI. We present crucial caveats with respect to these claims and provide an entry point for future discussions around the potentials and limitations of XAI for specific fairness desiderata. Importantly, we notice that claims are often (i) vague and simplistic, (ii) lacking normative grounding, or (iii) poorly aligned with the actual capabilities of XAI. We encourage to conceive XAI not as an ethical panacea but as one of many tools to approach the multidimensional, sociotechnical challenge of algorithmic fairness. Moreover, when making a claim about XAI and fairness, we emphasize the need to be more specific about what kind of XAI method is used and which fairness desideratum it refers to, how exactly it enables fairness, and who is the stakeholder that benefits from XAI.

Neural Networks · Networking · 早停 · 卷積神經網絡 · 卷積 ·

2024 年 2 月 7 日

Early Stopping of Untrained Convolutional Neural Networks

Tim Jahn,Bangti Jin

In recent years, new regularization methods based on (deep) neural networks have shown very promising empirical performance for the numerical solution of ill-posed problems, such as in medical imaging and imaging science. Due to the nonlinearity of neural networks, these methods often lack satisfactory theoretical justification. In this work, we rigorously discuss the convergence of a successful unsupervised approach that utilizes untrained convolutional neural networks to represent solutions to linear ill-posed problems. Untrained neural networks have particular appeal for many applications because they do not require paired training data. The regularization property of the approach relies solely on the architecture of the neural network instead. Due to the vast over-parameterization of the employed neural network, suitable early stopping is essential for the success of the method. We establish that the classical discrepancy principle is an adequate method for early stopping of two-layer untrained convolutional neural networks learned by gradient descent, and furthermore, it yields an approximation with minimax optimal convergence rates. Numerical results are also presented to illustrate the theoretical findings.

秩 · 情景 · search engine · Principle · 成比例 ·

2024 年 2 月 7 日

Recency Ranking by Diversification of Result Set

Andrey Styskin,Fedor Romanenko,Fedor Vorobyev,Pavel Serdyukov

In this paper, we propose a web search retrieval approach which automatically detects recency sensitive queries and increases the freshness of the ordinary document ranking by a degree proportional to the probability of the need in recent content. We propose to solve the recency ranking problem by using result diversification principles and deal with the query's non-topical ambiguity appearing when the need in recent content can be detected only with uncertainty. Our offline and online experiments with millions of queries from real search engine users demonstrate the significant increase in satisfaction of users presented with a search result generated by our approach.

Networking · 學成 · Principle · MoDELS · Networks ·

2021 年 6 月 18 日

The Principles of Deep Learning Theory

Daniel A. Roberts,Sho Yaida,Boris Hanin

from arxiv, 451 pages, to be published by Cambridge University Press

This book develops an effective theory approach to understanding deep neural networks of practical relevance. Beginning from a first-principles component-level picture of networks, we explain how to determine an accurate description of the output of trained networks by solving layer-to-layer iteration equations and nonlinear learning dynamics. A main result is that the predictions of networks are described by nearly-Gaussian distributions, with the depth-to-width aspect ratio of the network controlling the deviations from the infinite-width Gaussian description. We explain how these effectively-deep networks learn nontrivial representations from training and more broadly analyze the mechanism of representation learning for nonlinear models. From a nearly-kernel-methods perspective, we find that the dependence of such models' predictions on the underlying learning algorithm can be expressed in a simple and universal way. To obtain these results, we develop the notion of representation group flow (RG flow) to characterize the propagation of signals through the network. By tuning networks to criticality, we give a practical solution to the exploding and vanishing gradient problem. We further explain how RG flow leads to near-universal behavior and lets us categorize networks built from different activation functions into universality classes. Altogether, we show that the depth-to-width ratio governs the effective model complexity of the ensemble of trained networks. By using information-theoretic techniques, we estimate the optimal aspect ratio at which we expect the network to be practically most useful and show how residual connections can be used to push this scale to arbitrary depths. With these tools, we can learn in detail about the inductive bias of architectures, hyperparameters, and optimizers.

圖形處理器 · 圖 · 可辨認的 · Neural Networks · Networking ·

2021 年 5 月 31 日

On Explainability of Graph Neural Networks via Subgraph Explorations

Hao Yuan,Haiyang Yu,Jie Wang,Kang Li,Shuiwang Ji

from arxiv, Accepted by ICML 2021

We consider the problem of explaining the predictions of graph neural networks (GNNs), which otherwise are considered as black boxes. Existing methods invariably focus on explaining the importance of graph nodes or edges but ignore the substructures of graphs, which are more intuitive and human-intelligible. In this work, we propose a novel method, known as SubgraphX, to explain GNNs by identifying important subgraphs. Given a trained GNN model and an input graph, our SubgraphX explains its predictions by efficiently exploring different subgraphs with Monte Carlo tree search. To make the tree search more effective, we propose to use Shapley values as a measure of subgraph importance, which can also capture the interactions among different subgraphs. To expedite computations, we propose efficient approximation schemes to compute Shapley values for graph data. Our work represents the first attempt to explain GNNs via identifying subgraphs explicitly and directly. Experimental results show that our SubgraphX achieves significantly improved explanations, while keeping computations at a reasonable level.

注意力機制 · 注意力模型 · MoDELS · Neural Networks · Taxonomy ·

2020 年 12 月 15 日

An Attentive Survey of Attention Models

Sneha Chaudhari,Varun Mithal,Gungor Polatkan,Rohan Ramanath

from arxiv, submitted to Transactions on Intelligent Systems and Technology(TIST); 20 pages

Attention Model has now become an important concept in neural networks that has been researched within diverse application domains. This survey provides a structured and comprehensive overview of the developments in modeling attention. In particular, we propose a taxonomy which groups existing techniques into coherent categories. We review salient neural architectures in which attention has been incorporated, and discuss applications in which modeling attention has shown a significant impact. Finally, we also describe how attention has been used to improve the interpretability of neural networks. We hope this survey will provide a succinct introduction to attention models and guide practitioners while developing approaches for their applications.

圖 · 學成 · state-of-the-art · GNN · 表示學習 ·

2018 年 6 月 26 日

Hierarchical Graph Representation Learning with Differentiable Pooling

Rex Ying,Jiaxuan You,Christopher Morris,Xiang Ren,William L. Hamilton,Jure Leskovec

Recently, graph neural networks (GNNs) have revolutionized the field of graph representation learning through effectively learned node embeddings, and achieved state-of-the-art results in tasks such as node classification and link prediction. However, current GNN methods are inherently flat and do not learn hierarchical representations of graphs---a limitation that is especially problematic for the task of graph classification, where the goal is to predict the label associated with an entire graph. Here we propose DiffPool, a differentiable graph pooling module that can generate hierarchical representations of graphs and can be combined with various graph neural network architectures in an end-to-end fashion. DiffPool learns a differentiable soft cluster assignment for nodes at each layer of a deep GNN, mapping nodes to a set of clusters, which then form the coarsened input for the next GNN layer. Our experimental results show that combining existing GNN methods with DiffPool yields an average improvement of 5-10% accuracy on graph classification benchmarks, compared to all existing pooling approaches, achieving a new state-of-the-art on four out of five benchmark data sets.

注意力機制 · 機器閱讀理解 · Extensibility · state-of-the-art · MoDELS ·

2018 年 4 月 25 日

Reinforced Mnemonic Reader for Machine Reading Comprehension

Minghao Hu,Yuxing Peng,Zhen Huang,Xipeng Qiu,Furu Wei,Ming Zhou

from arxiv, Published in 26th International Joint Conference on Artificial Intelligence (IJCAI), 2018

In this paper, we introduce the Reinforced Mnemonic Reader for machine reading comprehension tasks, which enhances previous attentive readers in two aspects. First, a reattention mechanism is proposed to refine current attentions by directly accessing to past attentions that are temporally memorized in a multi-round alignment architecture, so as to avoid the problems of attention redundancy and attention deficiency. Second, a new optimization approach, called dynamic-critical reinforcement learning, is introduced to extend the standard supervised method. It always encourages to predict a more acceptable answer so as to address the convergence suppression problem occurred in traditional reinforcement learning algorithms. Extensive experiments on the Stanford Question Answering Dataset (SQuAD) show that our model achieves state-of-the-art results. Meanwhile, our model outperforms previous systems by over 6% in terms of both Exact Match and F1 metrics on two adversarial SQuAD datasets.

卷積神經網絡 · Neural Networks · 知識表示 · Networking · 卷積 ·

2018 年 2 月 14 日

Interpretable Convolutional Neural Networks

Quanshi Zhang,Ying Nian Wu,Song-Chun Zhu

from arxiv, In this version, we release the website of the code. Compared to the previous version, we have corrected all values of location instability in Table 3--6 by dividing the values by sqrt(2), i.e., a=a/sqrt(2). Such revisions do NOT decrease the significance of the superior performance of our method, because we make the same correction to location-instability values of all baselines

This paper proposes a method to modify traditional convolutional neural networks (CNNs) into interpretable CNNs, in order to clarify knowledge representations in high conv-layers of CNNs. In an interpretable CNN, each filter in a high conv-layer represents a certain object part. We do not need any annotations of object parts or textures to supervise the learning process. Instead, the interpretable CNN automatically assigns each filter in a high conv-layer with an object part during the learning process. Our method can be applied to different types of CNNs with different structures. The clear knowledge representation in an interpretable CNN can help people understand the logics inside a CNN, i.e., based on which patterns the CNN makes the decision. Experiments showed that filters in an interpretable CNN were more semantically meaningful than those in traditional CNNs.