亚洲色偷偷色噜噜狠狠99网VR_国产乱人弄视频免费观看_国产高清一区二区在线影院_久久91超碰色中文字幕总站_五月婷婷六月丁香免费视频_国产一区二区三区AV无码_精品日韩一区二区视频播放

This note addresses the Kolmogorov-Arnold Representation Theorem (KART) and the Universal Approximation Theorem (UAT), focusing on their common misinterpretations in some papers related to neural network approximation. Our remarks aim to support a more accurate understanding of KART and UAT among neural network specialists.

相關內容

Neural Networks

關注 1648

神(shen)(shen)(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)(luo)（Neural Networks）是世界(jie)上(shang)三個最古老的(de)(de)(de)(de)神(shen)(shen)(shen)經(jing)(jing)(jing)建模學會(hui)(hui)的(de)(de)(de)(de)檔案期(qi)刊:國(guo)際(ji)神(shen)(shen)(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)(luo)學會(hui)(hui)(INNS)、歐洲神(shen)(shen)(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)(luo)學會(hui)(hui)(ENNS)和(he)日本神(shen)(shen)(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)(luo)學會(hui)(hui)(JNNS)。神(shen)(shen)(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)(luo)提供了一(yi)(yi)個論壇(tan)，以(yi)發(fa)(fa)展(zhan)和(he)培(pei)育一(yi)(yi)個國(guo)際(ji)社會(hui)(hui)的(de)(de)(de)(de)學者和(he)實踐者感興趣(qu)的(de)(de)(de)(de)所有(you)方面(mian)的(de)(de)(de)(de)神(shen)(shen)(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)(luo)和(he)相關方法的(de)(de)(de)(de)計(ji)算(suan)(suan)智能。神(shen)(shen)(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)(luo)歡迎高質量論文(wen)的(de)(de)(de)(de)提交，有(you)助于全面(mian)的(de)(de)(de)(de)神(shen)(shen)(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)(luo)研(yan)(yan)究，從(cong)行為(wei)和(he)大腦建模，學習算(suan)(suan)法，通(tong)過數(shu)學和(he)計(ji)算(suan)(suan)分析(xi)，系統的(de)(de)(de)(de)工(gong)程(cheng)和(he)技(ji)術應(ying)用，大量使用神(shen)(shen)(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)(luo)的(de)(de)(de)(de)概念和(he)技(ji)術。這一(yi)(yi)獨(du)特而廣(guang)泛的(de)(de)(de)(de)范圍(wei)促進(jin)了生物和(he)技(ji)術研(yan)(yan)究之間的(de)(de)(de)(de)思想(xiang)交流，并有(you)助于促進(jin)對生物啟發(fa)(fa)的(de)(de)(de)(de)計(ji)算(suan)(suan)智能感興趣(qu)的(de)(de)(de)(de)跨(kua)學科(ke)社區的(de)(de)(de)(de)發(fa)(fa)展(zhan)。因(yin)此，神(shen)(shen)(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)(luo)編委(wei)會(hui)(hui)代(dai)表(biao)的(de)(de)(de)(de)專(zhuan)家領域(yu)包括心(xin)理學，神(shen)(shen)(shen)經(jing)(jing)(jing)生物學，計(ji)算(suan)(suan)機科(ke)學，工(gong)程(cheng)，數(shu)學，物理。該雜志(zhi)發(fa)(fa)表(biao)文(wen)章、信件和(he)評論以(yi)及給(gei)編輯的(de)(de)(de)(de)信件、社論、時(shi)事(shi)、軟件調查和(he)專(zhuan)利(li)信息(xi)。文(wen)章發(fa)(fa)表(biao)在(zai)五個部分之一(yi)(yi):認知科(ke)學，神(shen)(shen)(shen)經(jing)(jing)(jing)科(ke)學，學習系統，數(shu)學和(he)計(ji)算(suan)(suan)分析(xi)、工(gong)程(cheng)和(he)應(ying)用。官網(wang)(wang)地址：

圖 · 異常點 · Networking · Performer · Learning ·

2023 年 8 月 13 日

Learning on Graphs with Out-of-Distribution Nodes

Yu Song,Donglin Wang

from arxiv, Accepted by KDD'22

Graph Neural Networks (GNNs) are state-of-the-art models for performing prediction tasks on graphs. While existing GNNs have shown great performance on various tasks related to graphs, little attention has been paid to the scenario where out-of-distribution (OOD) nodes exist in the graph during training and inference. Borrowing the concept from CV and NLP, we define OOD nodes as nodes with labels unseen from the training set. Since a lot of networks are automatically constructed by programs, real-world graphs are often noisy and may contain nodes from unknown distributions. In this work, we define the problem of graph learning with out-of-distribution nodes. Specifically, we aim to accomplish two tasks: 1) detect nodes which do not belong to the known distribution and 2) classify the remaining nodes to be one of the known classes. We demonstrate that the connection patterns in graphs are informative for outlier detection, and propose Out-of-Distribution Graph Attention Network (OODGAT), a novel GNN model which explicitly models the interaction between different kinds of nodes and separate inliers from outliers during feature propagation. Extensive experiments show that OODGAT outperforms existing outlier detection methods by a large margin, while being better or comparable in terms of in-distribution classification.

圖片分類 · 前饋網絡 · INTERACT · Networking · 前饋 ·

2021 年 5 月 7 日

ResMLP: Feedforward networks for image classification with data-efficient training

Hugo Touvron,Piotr Bojanowski,Mathilde Caron,Matthieu Cord,Alaaeldin El-Nouby,Edouard Grave,Armand Joulin,Gabriel Synnaeve,Jakob Verbeek,Hervé Jégou

We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification. It is a simple residual network that alternates (i) a linear layer in which image patches interact, independently and identically across channels, and (ii) a two-layer feed-forward network in which channels interact independently per patch. When trained with a modern training strategy using heavy data-augmentation and optionally distillation, it attains surprisingly good accuracy/complexity trade-offs on ImageNet. We will share our code based on the Timm library and pre-trained models.

Neural Networks · Parse · Networking · 粵港澳大灣區數字經濟研究院 · 解析樹 ·

2021 年 2 月 25 日

How to represent part-whole hierarchies in a neural network

Geoffrey Hinton

from arxiv, 43 pages, 5 figures

This paper does not describe a working system. Instead, it presents a single idea about representation which allows advances made by several different groups to be combined into an imaginary system called GLOM. The advances include transformers, neural fields, contrastive representation learning, distillation and capsules. GLOM answers the question: How can a neural network with a fixed architecture parse an image into a part-whole hierarchy which has a different structure for each image? The idea is simply to use islands of identical vectors to represent the nodes in the parse tree. If GLOM can be made to work, it should significantly improve the interpretability of the representations produced by transformer-like systems when applied to vision or language

Facebook · Social Graph · 求逆 · tuning · MoDELS ·

2020 年 6 月 20 日

Embedding-based Retrieval in Facebook Search

Jui-Ting Huang,Ashish Sharma,Shuying Sun,Li Xia,David Zhang,Philip Pronin,Janani Padmanabhan,Giuseppe Ottaviano,Linjun Yang

from arxiv, 9 pages, 3 figures, 3 tables, to be published in KDD '20

Search in social networks such as Facebook poses different challenges than in classical web search: besides the query text, it is important to take into account the searcher's context to provide relevant results. Their social graph is an integral part of this context and is a unique aspect of Facebook search. While embedding-based retrieval (EBR) has been applied in eb search engines for years, Facebook search was still mainly based on a Boolean matching model. In this paper, we discuss the techniques for applying EBR to a Facebook Search system. We introduce the unified embedding framework developed to model semantic embeddings for personalized search, and the system to serve embedding-based retrieval in a typical search system based on an inverted index. We discuss various tricks and experiences on end-to-end optimization of the whole system, including ANN parameter tuning and full-stack optimization. Finally, we present our progress on two selected advanced topics about modeling. We evaluated EBR on verticals for Facebook Search with significant metrics gains observed in online A/B experiments. We believe this paper will provide useful insights and experiences to help people on developing embedding-based retrieval systems in search engines.

生成對抗網絡 · 支持向量機 ·

2019 年 10 月 17 日

[付費5元查看完整內(nei)容]Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

專知會員服務

專知，提供專業可信的知識分發服務，讓認知協作更快更好！

*《Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs》A Jolicoeur-Martineau, I Mitliagkas [Mila] (2019)

付費5元查看完整內容

BERT · 語言表示 · state-of-the-art · 可理解性 · MoDELS ·

2019 年 5 月 24 日

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin,Ming-Wei Chang,Kenton Lee,Kristina Toutanova

We introduce a new language representation model called BERT, which stands for Bidirectional Encoder Representations from Transformers. Unlike recent language representation models, BERT is designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers. As a result, the pre-trained BERT model can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks, such as question answering and language inference, without substantial task-specific architecture modifications. BERT is conceptually simple and empirically powerful. It obtains new state-of-the-art results on eleven natural language processing tasks, including pushing the GLUE score to 80.5% (7.7% point absolute improvement), MultiNLI accuracy to 86.7% (4.6% absolute improvement), SQuAD v1.1 question answering Test F1 to 93.2 (1.5 point absolute improvement) and SQuAD v2.0 Test F1 to 83.1 (5.1 point absolute improvement).

知識表示 · Things · 推薦系統 · MoDELS · 邊 ·

2018 年 5 月 10 日

A Unified Knowledge Representation and Context-aware Recommender System in Internet of Things

Yinhao Li,Awa Alqahtani,Ellis Solaiman,Charith Perera,Prem Prakash Jayaraman,Boualem Benatallah,Rajiv Ranjan

Within the rapidly developing Internet of Things (IoT), numerous and diverse physical devices, Edge devices, Cloud infrastructure, and their quality of service requirements (QoS), need to be represented within a unified specification in order to enable rapid IoT application development, monitoring, and dynamic reconfiguration. But heterogeneities among different configuration knowledge representation models pose limitations for acquisition, discovery and curation of configuration knowledge for coordinated IoT applications. This paper proposes a unified data model to represent IoT resource configuration knowledge artifacts. It also proposes IoT-CANE (Context-Aware recommendatioN systEm) to facilitate incremental knowledge acquisition and declarative context driven knowledge recommendation.

Networking · Neural Networks · 卷積神經網絡 · 卷積 · Network Dissection ·

2018 年 4 月 30 日

How convolutional neural network see the world - A survey of convolutional neural network visualization methods

Zhuwei Qin,Funxun Yu,Chenchen Liu,Xiang Chen

from arxiv, 32 pages, 21 figures. Mathematical Foundations of Computing

Nowadays, the Convolutional Neural Networks (CNNs) have achieved impressive performance on many computer vision related tasks, such as object detection, image recognition, image retrieval, etc. These achievements benefit from the CNNs outstanding capability to learn the input features with deep layers of neuron structures and iterative training process. However, these learned features are hard to identify and interpret from a human vision perspective, causing a lack of understanding of the CNNs internal working mechanism. To improve the CNN interpretability, the CNN visualization is well utilized as a qualitative analysis method, which translates the internal features into visually perceptible patterns. And many CNN visualization works have been proposed in the literature to interpret the CNN in perspectives of network structure, operation, and semantic concept. In this paper, we expect to provide a comprehensive survey of several representative CNN visualization methods, including Activation Maximization, Network Inversion, Deconvolutional Neural Networks (DeconvNet), and Network Dissection based visualization. These methods are presented in terms of motivations, algorithms, and experiment results. Based on these visualization methods, we also discuss their practical applications to demonstrate the significance of the CNN interpretability in areas of network design, optimization, security enhancement, etc.

GROUP · INFORMS · Weight · Extensibility · 學成 ·

2018 年 4 月 18 日

Attention-based Group Recommendation

Tran Dang Quang Vinh,Tuan-Anh Nguyen Pham,Gao Cong,Xiao-Li Li

Recommender systems are widely used in big information-based companies such as Google, Twitter, LinkedIn, and Netflix. A recommender system deals with the problem of information overload by filtering important information fragments according to users' preferences. In light of the increasing success of deep learning, recent studies have proved the benefits of using deep learning in various recommendation tasks. However, most proposed techniques only aim to target individuals, which cannot be efficiently applied in group recommendation. In this paper, we propose a deep learning architecture to solve the group recommendation problem. On the one hand, as different individual preferences in a group necessitate preference trade-offs in making group recommendations, it is essential that the recommendation model can discover substitutes among user behaviors. On the other hand, it has been observed that a user as an individual and as a group member behaves differently. To tackle such problems, we propose using an attention mechanism to capture the impact of each user in a group. Specifically, our model automatically learns the influence weight of each user in a group and recommends items to the group based on its members' weighted preferences. We conduct extensive experiments on four datasets. Our model significantly outperforms baseline methods and shows promising results in applying deep learning to the group recommendation problem.

FCN · 全卷積網絡 · 3D · 級聯 · MoDELS ·

2018 年 3 月 20 日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Holger R. Roth,Hirohisa Oda,Xiangrong Zhou,Natsuki Shimizu,Ying Yang,Yuichiro Hayashi,Masahiro Oda,Michitaka Fujiwara,Kazunari Misawa,Kensaku Mori

from arxiv, Preprint accepted for publication in Computerized Medical Imaging and Graphics. Substantial extension of arXiv:1704.06382; Corrected references to figure numbers in this version

Recent advances in 3D fully convolutional networks (FCN) have made it feasible to produce dense voxel-wise predictions of volumetric images. In this work, we show that a multi-class 3D FCN trained on manually labeled CT scans of several anatomical structures (ranging from the large organs to thin vessels) can achieve competitive segmentation results, while avoiding the need for handcrafting features or training class-specific models. To this end, we propose a two-stage, coarse-to-fine approach that will first use a 3D FCN to roughly define a candidate region, which will then be used as input to a second 3D FCN. This reduces the number of voxels the second FCN has to classify to ~10% and allows it to focus on more detailed segmentation of the organs and vessels. We utilize training and validation sets consisting of 331 clinical CT images and test our models on a completely unseen data collection acquired at a different hospital that includes 150 CT scans, targeting three anatomical organs (liver, spleen, and pancreas). In challenging organs such as the pancreas, our cascaded approach improves the mean Dice score from 68.5 to 82.2%, achieving the highest reported average score on this dataset. We compare with a 2D FCN method on a separate dataset of 240 CT scans with 18 classes and achieve a significantly higher performance in small organs and vessels. Furthermore, we explore fine-tuning our models to different datasets. Our experiments illustrate the promise and robustness of current 3D FCN based semantic segmentation of medical images, achieving state-of-the-art results. Our code and trained models are available for download: //github.com/holgerroth/3Dunet_abdomen_cascade.