国产精品亚洲综合久久_亚洲专区中文字幕专区_最近中文字幕无码版免费视频_色翁荡息又大又硬又粗又爽电影_国产自愉自愉免费精品七区_黄色小视频免费看_99久久精品欧美日韩精品

A longstanding question in cognitive science concerns the learning mechanisms underlying compositionality in human cognition. Humans can infer the structured relationships (e.g., grammatical rules) implicit in their sensory observations (e.g., auditory speech), and use this knowledge to guide the composition of simpler meanings into complex wholes. Recent progress in artificial neural networks has shown that when large models are trained on enough linguistic data, grammatical structure emerges in their representations. We extend this work to the domain of mathematical reasoning, where it is possible to formulate precise hypotheses about how meanings (e.g., the quantities corresponding to numerals) should be composed according to structured rules (e.g., order of operations). Our work shows that neural networks are not only able to infer something about the structured relationships implicit in their training data, but can also deploy this knowledge to guide the composition of individual meanings into composite wholes.

相關內容

Neural Networks

關注 1648

神(shen)(shen)(shen)(shen)(shen)經網(wang)(wang)(wang)絡(luo)(luo)（Neural Networks）是世界上三(san)個(ge)(ge)最(zui)古老的(de)神(shen)(shen)(shen)(shen)(shen)經建模(mo)學(xue)(xue)會(hui)(hui)(hui)的(de)檔案期刊:國際神(shen)(shen)(shen)(shen)(shen)經網(wang)(wang)(wang)絡(luo)(luo)學(xue)(xue)會(hui)(hui)(hui)(INNS)、歐(ou)洲神(shen)(shen)(shen)(shen)(shen)經網(wang)(wang)(wang)絡(luo)(luo)學(xue)(xue)會(hui)(hui)(hui)(ENNS)和(he)(he)日(ri)本神(shen)(shen)(shen)(shen)(shen)經網(wang)(wang)(wang)絡(luo)(luo)學(xue)(xue)會(hui)(hui)(hui)(JNNS)。神(shen)(shen)(shen)(shen)(shen)經網(wang)(wang)(wang)絡(luo)(luo)提(ti)(ti)供(gong)了一(yi)個(ge)(ge)論(lun)壇，以發(fa)展(zhan)和(he)(he)培(pei)育一(yi)個(ge)(ge)國際社(she)(she)會(hui)(hui)(hui)的(de)學(xue)(xue)者和(he)(he)實踐(jian)者感興趣的(de)所有方(fang)面的(de)神(shen)(shen)(shen)(shen)(shen)經網(wang)(wang)(wang)絡(luo)(luo)和(he)(he)相關方(fang)法(fa)的(de)計(ji)算智(zhi)能。神(shen)(shen)(shen)(shen)(shen)經網(wang)(wang)(wang)絡(luo)(luo)歡迎高質量論(lun)文(wen)的(de)提(ti)(ti)交，有助(zhu)于全面的(de)神(shen)(shen)(shen)(shen)(shen)經網(wang)(wang)(wang)絡(luo)(luo)研究，從行(xing)為和(he)(he)大腦建模(mo)，學(xue)(xue)習(xi)算法(fa)，通過數學(xue)(xue)和(he)(he)計(ji)算分析(xi)，系(xi)統的(de)工(gong)(gong)程和(he)(he)技術應(ying)用(yong)，大量使(shi)用(yong)神(shen)(shen)(shen)(shen)(shen)經網(wang)(wang)(wang)絡(luo)(luo)的(de)概(gai)念和(he)(he)技術。這(zhe)一(yi)獨(du)特而廣泛的(de)范圍促進了生物(wu)和(he)(he)技術研究之間的(de)思(si)想交流，并有助(zhu)于促進對生物(wu)啟發(fa)的(de)計(ji)算智(zhi)能感興趣的(de)跨學(xue)(xue)科(ke)社(she)(she)區的(de)發(fa)展(zhan)。因(yin)此(ci)，神(shen)(shen)(shen)(shen)(shen)經網(wang)(wang)(wang)絡(luo)(luo)編(bian)委會(hui)(hui)(hui)代表的(de)專家(jia)領域包括心理學(xue)(xue)，神(shen)(shen)(shen)(shen)(shen)經生物(wu)學(xue)(xue)，計(ji)算機科(ke)學(xue)(xue)，工(gong)(gong)程，數學(xue)(xue)，物(wu)理。該雜(za)志發(fa)表文(wen)章、信件和(he)(he)評(ping)論(lun)以及(ji)給編(bian)輯的(de)信件、社(she)(she)論(lun)、時事、軟件調查和(he)(he)專利(li)信息。文(wen)章發(fa)表在五(wu)個(ge)(ge)部(bu)分之一(yi):認知科(ke)學(xue)(xue)，神(shen)(shen)(shen)(shen)(shen)經科(ke)學(xue)(xue)，學(xue)(xue)習(xi)系(xi)統，數學(xue)(xue)和(he)(he)計(ji)算分析(xi)、工(gong)(gong)程和(he)(he)應(ying)用(yong)。官網(wang)(wang)(wang)地址：

圖 · Processing（編程語言） · NLP · Neural Networks · 圖形處理器 ·

2021 年 6 月 10 日

Graph Neural Networks for Natural Language Processing: A Survey

Lingfei Wu,Yu Chen,Kai Shen,Xiaojie Guo,Hanning Gao,Shucheng Li,Jian Pei,Bo Long

from arxiv, 127 pages

Deep learning has become the dominant approach in coping with various tasks in Natural LanguageProcessing (NLP). Although text inputs are typically represented as a sequence of tokens, there isa rich variety of NLP problems that can be best expressed with a graph structure. As a result, thereis a surge of interests in developing new deep learning techniques on graphs for a large numberof NLP tasks. In this survey, we present a comprehensive overview onGraph Neural Networks(GNNs) for Natural Language Processing. We propose a new taxonomy of GNNs for NLP, whichsystematically organizes existing research of GNNs for NLP along three axes: graph construction,graph representation learning, and graph based encoder-decoder models. We further introducea large number of NLP applications that are exploiting the power of GNNs and summarize thecorresponding benchmark datasets, evaluation metrics, and open-source codes. Finally, we discussvarious outstanding challenges for making the full use of GNNs for NLP as well as future researchdirections. To the best of our knowledge, this is the first comprehensive overview of Graph NeuralNetworks for Natural Language Processing.

注意力機制 · 注意力模型 · 可辨認的 · MoDELS · Automator ·

2021 年 3 月 31 日

Attention, please! A survey of Neural Attention Models in Deep Learning

Alana de Santana Correia,Esther Luna Colombini

from arxiv, 66 pages, 24 figures

In humans, Attention is a core property of all perceptual and cognitive operations. Given our limited ability to process competing sources, attention mechanisms select, modulate, and focus on the information most relevant to behavior. For decades, concepts and functions of attention have been studied in philosophy, psychology, neuroscience, and computing. For the last six years, this property has been widely explored in deep neural networks. Currently, the state-of-the-art in Deep Learning is represented by neural attention models in several application domains. This survey provides a comprehensive overview and analysis of developments in neural attention models. We systematically reviewed hundreds of architectures in the area, identifying and discussing those in which attention has shown a significant impact. We also developed and made public an automated methodology to facilitate the development of reviews in the area. By critically analyzing 650 works, we describe the primary uses of attention in convolutional, recurrent networks and generative models, identifying common subgroups of uses and applications. Furthermore, we describe the impact of attention in different application domains and their impact on neural networks' interpretability. Finally, we list possible trends and opportunities for further research, hoping that this review will provide a succinct overview of the main attentional models in the area and guide researchers in developing future approaches that will drive further improvements.

學成 · 大數據 · 相同 · 人工智能 · 統計方法 ·

2020 年 5 月 5 日

A Survey of Learning Causality with Data: Problems and Methods

Ruocheng Guo,Lu Cheng,Jundong Li,P. Richard Hahn,Huan Liu

from arxiv, 35 pages, accepted by ACM CSUR

This work considers the question of how convenient access to copious data impacts our ability to learn causal effects and relations. In what ways is learning causality in the era of big data different from -- or the same as -- the traditional one? To answer this question, this survey provides a comprehensive and structured review of both traditional and frontier methods in learning causality and relations along with the connections between causality and machine learning. This work points out on a case-by-case basis how big data facilitates, complicates, or motivates each approach.

圖 · Neural Networks · 圖形處理器 · Networking · 圖注意力網絡 ·

2019 年 7 月 10 日

Graph Neural Networks: A Review of Methods and Applications

Jie Zhou,Ganqu Cui,Zhengyan Zhang,Cheng Yang,Zhiyuan Liu,Lifeng Wang,Changcheng Li,Maosong Sun

Lots of learning tasks require dealing with graph data which contains rich relation information among elements. Modeling physics system, learning molecular fingerprints, predicting protein interface, and classifying diseases require a model to learn from graph inputs. In other domains such as learning from non-structural data like texts and images, reasoning on extracted structures, like the dependency tree of sentences and the scene graph of images, is an important research topic which also needs graph reasoning models. Graph neural networks (GNNs) are connectionist models that capture the dependence of graphs via message passing between the nodes of graphs. Unlike standard neural networks, graph neural networks retain a state that can represent information from its neighborhood with arbitrary depth. Although the primitive GNNs have been found difficult to train for a fixed point, recent advances in network architectures, optimization techniques, and parallel computation have enabled successful learning with them. In recent years, systems based on variants of graph neural networks such as graph convolutional network (GCN), graph attention network (GAT), gated graph neural network (GGNN) have demonstrated ground-breaking performance on many tasks mentioned above. In this survey, we provide a detailed review over existing graph neural network models, systematically categorize the applications, and propose four open problems for future research.

文本分類 · 可理解性 · Machine Learning · 學成 · 降維 ·

2019 年 4 月 17 日

Text Classification Algorithms: A Survey

Kamran Kowsari,Kiana Jafari Meimandi,Mojtaba Heidarysafa,Sanjana Mendu,Laura E. Barnes,Donald E. Brown

In recent years, there has been an exponential growth in the number of complex documents and texts that require a deeper understanding of machine learning methods to be able to accurately classify texts in many applications. Many machine learning approaches have achieved surpassing results in natural language processing. The success of these learning algorithms relies on their capacity to understand complex models and non-linear relationships within data. However, finding suitable structures, architectures, and techniques for text classification is a challenge for researchers. In this paper, a brief overview of text classification algorithms is discussed. This overview covers different text feature extractions, dimensionality reduction methods, existing algorithms and techniques, and evaluations methods. Finally, the limitations of each technique and their application in the real-world problem are discussed.

注意力機制 · 注意力模型 · Processing（編程語言） · 評論員 · MoDELS ·

2019 年 2 月 4 日

Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

Andrea Galassi,Marco Lippi,Paolo Torroni

Attention is an increasingly popular mechanism used in a wide range of neural architectures. Because of the fast-paced advances in this domain, a systematic overview of attention is still missing. In this article, we define a unified model for attention architectures for natural language processing, with a focus on architectures designed to work with vector representation of the textual data. We discuss the dimensions along which proposals differ, the possible uses of attention, and chart the major research activities and open challenges in the area.

Processing（編程語言） · Neural Networks · Networking · MoDELS · 有向 ·

2019 年 1 月 14 日

Analysis Methods in Neural Language Processing: A Survey

Yonatan Belinkov,James Glass

from arxiv, Version including the supplementary materials (3 tables), also available at //boknilev.github.io/nlp-analysis-methods

The field of natural language processing has seen impressive progress in recent years, with neural network models replacing many of the traditional systems. A plethora of new models have been proposed, many of which are thought to be opaque compared to their feature-rich counterparts. This has led researchers to analyze, interpret, and evaluate neural networks in novel and more fine-grained ways. In this survey paper, we review analysis methods in neural language processing, categorize them according to prominent research trends, highlight existing limitations, and point to potential directions for future work.

概率圖模型 · 圖 · 推斷 · GM · 信念傳播 ·

2018 年 5 月 25 日

Inference in Probabilistic Graphical Models by Graph Neural Networks

KiJung Yoon,Renjie Liao,Yuwen Xiong,Lisa Zhang,Ethan Fetaya,Raquel Urtasun,Richard Zemel,Xaq Pitkow

A fundamental computation for statistical inference and accurate decision-making is to compute the marginal probabilities or most probable states of task-relevant variables. Probabilistic graphical models can efficiently represent the structure of such complex data, but performing these inferences is generally difficult. Message-passing algorithms, such as belief propagation, are a natural way to disseminate evidence amongst correlated variables while exploiting the graph structure, but these algorithms can struggle when the conditional dependency graphs contain loops. Here we use Graph Neural Networks (GNNs) to learn a message-passing algorithm that solves these inference tasks. We first show that the architecture of GNNs is well-matched to inference tasks. We then demonstrate the efficacy of this inference approach by training GNNs on a collection of graphical models and showing that they substantially outperform belief propagation on loopy graphs. Our message-passing algorithms generalize out of the training set to larger graphs and graphs with different structure.

學成 · 圖 · Processing（編程語言） · 知識圖譜 · MoDELS ·

2018 年 2 月 16 日

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

K M Annervaz,Somnath Basu Roy Chowdhury,Ambedkar Dukkipati

Machine Learning has been the quintessential solution for many AI problems, but learning is still heavily dependent on the specific training data. Some learning models can be incorporated with a prior knowledge in the Bayesian set up, but these learning models do not have the ability to access any organised world knowledge on demand. In this work, we propose to enhance learning models with world knowledge in the form of Knowledge Graph (KG) fact triples for Natural Language Processing (NLP) tasks. Our aim is to develop a deep learning model that can extract relevant prior support facts from knowledge graphs depending on the task using attention mechanism. We introduce a convolution-based model for learning representations of knowledge graph entity and relation clusters in order to reduce the attention space. We show that the proposed method is highly scalable to the amount of prior information that has to be processed and can be applied to any generic NLP task. Using this method we show significant improvement in performance for text classification with News20, DBPedia datasets and natural language inference with Stanford Natural Language Inference (SNLI) dataset. We also demonstrate that a deep learning model can be trained well with substantially less amount of labeled training data, when it has access to organised world knowledge in the form of knowledge graph.

自動問答 · Networking · MoDELS · 學成 · 可理解性 ·

2015 年 11 月 29 日

Memory Networks

Jason Weston,Sumit Chopra,Antoine Bordes

We describe a new class of learning models called memory networks. Memory networks reason with inference components combined with a long-term memory component; they learn how to use these jointly. The long-term memory can be read and written to, with the goal of using it for prediction. We investigate these models in the context of question answering (QA) where the long-term memory effectively acts as a (dynamic) knowledge base, and the output is a textual response. We evaluate them on a large-scale QA task, and a smaller, but more complex, toy task generated from a simulated world. In the latter, we show the reasoning power of such models by chaining multiple supporting sentences to answer questions that require understanding the intension of verbs.