男男网站网址视频免费观看,国产一区二区黑人,又大又黄又爽又粗免费,又大又粗又黄又硬3P免费视频,国产一区福利高清在线观看

With more scientific fields relying on neural networks (NNs) to process data incoming at extreme throughputs and latencies, it is crucial to develop NNs with all their parameters stored on-chip. In many of these applications, there is not enough time to go off-chip and retrieve weights. Even more so, off-chip memory such as DRAM does not have the bandwidth required to process these NNs as fast as the data is being produced (e.g., every 25 ns). As such, these extreme latency and bandwidth requirements have architectural implications for the hardware intended to run these NNs: 1) all NN parameters must fit on-chip, and 2) codesigning custom/reconfigurable logic is often required to meet these latency and bandwidth constraints. In our work, we show that many scientific NN applications must run fully on chip, in the extreme case requiring a custom chip to meet such stringent constraints.

相關內容

Neural Networks

關注 1648

神(shen)(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)網(wang)(wang)(wang)(wang)絡(luo)(luo)(luo)（Neural Networks）是(shi)世界上三個最古老的(de)(de)神(shen)(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)建模(mo)學(xue)(xue)(xue)會(hui)(hui)的(de)(de)檔案期刊:國(guo)際(ji)神(shen)(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)網(wang)(wang)(wang)(wang)絡(luo)(luo)(luo)學(xue)(xue)(xue)會(hui)(hui)(INNS)、歐(ou)洲神(shen)(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)網(wang)(wang)(wang)(wang)絡(luo)(luo)(luo)學(xue)(xue)(xue)會(hui)(hui)(ENNS)和(he)(he)日本神(shen)(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)網(wang)(wang)(wang)(wang)絡(luo)(luo)(luo)學(xue)(xue)(xue)會(hui)(hui)(JNNS)。神(shen)(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)網(wang)(wang)(wang)(wang)絡(luo)(luo)(luo)提供了(le)一(yi)個論(lun)壇，以(yi)發(fa)(fa)(fa)展和(he)(he)培育(yu)一(yi)個國(guo)際(ji)社會(hui)(hui)的(de)(de)學(xue)(xue)(xue)者和(he)(he)實踐(jian)者感興趣的(de)(de)所有(you)方面的(de)(de)神(shen)(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)網(wang)(wang)(wang)(wang)絡(luo)(luo)(luo)和(he)(he)相關方法(fa)的(de)(de)計(ji)算(suan)智(zhi)(zhi)能。神(shen)(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)網(wang)(wang)(wang)(wang)絡(luo)(luo)(luo)歡迎高質量(liang)論(lun)文的(de)(de)提交(jiao)，有(you)助(zhu)于全面的(de)(de)神(shen)(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)網(wang)(wang)(wang)(wang)絡(luo)(luo)(luo)研(yan)究，從(cong)行為和(he)(he)大腦建模(mo)，學(xue)(xue)(xue)習算(suan)法(fa)，通過數(shu)(shu)學(xue)(xue)(xue)和(he)(he)計(ji)算(suan)分(fen)析，系(xi)統的(de)(de)工程和(he)(he)技術應(ying)用(yong)(yong)，大量(liang)使用(yong)(yong)神(shen)(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)網(wang)(wang)(wang)(wang)絡(luo)(luo)(luo)的(de)(de)概念和(he)(he)技術。這一(yi)獨特而廣泛(fan)的(de)(de)范圍促(cu)進(jin)(jin)了(le)生(sheng)物和(he)(he)技術研(yan)究之間的(de)(de)思想交(jiao)流，并(bing)有(you)助(zhu)于促(cu)進(jin)(jin)對生(sheng)物啟發(fa)(fa)(fa)的(de)(de)計(ji)算(suan)智(zhi)(zhi)能感興趣的(de)(de)跨學(xue)(xue)(xue)科(ke)(ke)社區的(de)(de)發(fa)(fa)(fa)展。因此(ci)，神(shen)(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)網(wang)(wang)(wang)(wang)絡(luo)(luo)(luo)編委會(hui)(hui)代表的(de)(de)專家領域包括心理學(xue)(xue)(xue)，神(shen)(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)生(sheng)物學(xue)(xue)(xue)，計(ji)算(suan)機科(ke)(ke)學(xue)(xue)(xue)，工程，數(shu)(shu)學(xue)(xue)(xue)，物理。該(gai)雜志發(fa)(fa)(fa)表文章、信(xin)件(jian)和(he)(he)評論(lun)以(yi)及(ji)給編輯的(de)(de)信(xin)件(jian)、社論(lun)、時(shi)事、軟(ruan)件(jian)調查(cha)和(he)(he)專利(li)信(xin)息(xi)。文章發(fa)(fa)(fa)表在五(wu)個部分(fen)之一(yi):認知(zhi)科(ke)(ke)學(xue)(xue)(xue)，神(shen)(shen)(shen)(shen)經(jing)(jing)(jing)(jing)(jing)科(ke)(ke)學(xue)(xue)(xue)，學(xue)(xue)(xue)習系(xi)統，數(shu)(shu)學(xue)(xue)(xue)和(he)(he)計(ji)算(suan)分(fen)析、工程和(he)(he)應(ying)用(yong)(yong)。官網(wang)(wang)(wang)(wang)地址(zhi)：

情景 · 隨機搜索 · 數據集 · 全 · Analysis ·

2024 年 4 月 26 日

The Inefficiency of Genetic Programming for Symbolic Regression -- Extended Version

Gabriel Kronberger,Fabricio Olivetti de Franca,Harry Desmond,Deaglan J. Bartlett,Lukas Kammerer

from arxiv, This is an extended version of the article submitted to Parallel Problem Solving from Nature (PPSN) Conference 2024

We analyse the search behaviour of genetic programming for symbolic regression in practically relevant but limited settings, allowing exhaustive enumeration of all solutions. This enables us to quantify the success probability of finding the best possible expressions, and to compare the search efficiency of genetic programming to random search in the space of semantically unique expressions. This analysis is made possible by improved algorithms for equality saturation, which we use to improve the Exhaustive Symbolic Regression algorithm; this produces the set of semantically unique expression structures, orders of magnitude smaller than the full symbolic regression search space. We compare the efficiency of random search in the set of unique expressions and genetic programming. For our experiments we use two real-world datasets where symbolic regression has been used to produce well-fitting univariate expressions: the Nikuradse dataset of flow in rough pipes and the Radial Acceleration Relation of galaxy dynamics. The results show that genetic programming in such limited settings explores only a small fraction of all unique expressions, and evaluates expressions repeatedly that are congruent to already visited expressions.

劃分 · Networking · 聯邦學習 · Networks · Learning ·

2024 年 4 月 25 日

Cross-Silo Federated Learning for Multi-Tier Networks with Vertical and Horizontal Data Partitioning

Anirban Das,Timothy Castiglia,Shiqiang Wang,Stacy Patterson

from arxiv, Published in ACM Transactions on Intelligent Systems and Technology (ACM TIST), 2022. Updated minor typos in the proof

We consider federated learning in tiered communication networks. Our network model consists of a set of silos, each holding a vertical partition of the data. Each silo contains a hub and a set of clients, with the silo's vertical data shard partitioned horizontally across its clients. We propose Tiered Decentralized Coordinate Descent (TDCD), a communication-efficient decentralized training algorithm for such two-tiered networks. The clients in each silo perform multiple local gradient steps before sharing updates with their hub to reduce communication overhead. Each hub adjusts its coordinates by averaging its workers' updates, and then hubs exchange intermediate updates with one another. We present a theoretical analysis of our algorithm and show the dependence of the convergence rate on the number of vertical partitions and the number of local updates. We further validate our approach empirically via simulation-based experiments using a variety of datasets and objectives.

生成式人工智能 · AI · Principle · 特化 · 最優化 ·

2023 年 12 月 13 日

A Survey of Generative AI for Intelligent Transportation Systems

Huan Yan,Yong Li

Intelligent transportation systems play a crucial role in modern traffic management and optimization, greatly improving traffic efficiency and safety. With the rapid development of generative artificial intelligence (Generative AI) technologies in the fields of image generation and natural language processing, generative AI has also played a crucial role in addressing key issues in intelligent transportation systems, such as data sparsity, difficulty in observing abnormal scenarios, and in modeling data uncertainty. In this review, we systematically investigate the relevant literature on generative AI techniques in addressing key issues in different types of tasks in intelligent transportation systems. First, we introduce the principles of different generative AI techniques, and their potential applications. Then, we classify tasks in intelligent transportation systems into four types: traffic perception, traffic prediction, traffic simulation, and traffic decision-making. We systematically illustrate how generative AI techniques addresses key issues in these four different types of tasks. Finally, we summarize the challenges faced in applying generative AI to intelligent transportation systems, and discuss future research directions based on different application scenarios.

Performer · Neural Networks · 圖 · 圖形處理器 · 動力系統 ·

2022 年 11 月 10 日

Unravelling the Performance of Physics-informed Graph Neural Networks for Dynamical Systems

Abishek Thangamuthu,Gunjan Kumar,Suresh Bishnoi,Ravinder Bhattoo,N M Anoop Krishnan,Sayan Ranu

from arxiv, Accepted at 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

Recently, graph neural networks have been gaining a lot of attention to simulate dynamical systems due to their inductive nature leading to zero-shot generalizability. Similarly, physics-informed inductive biases in deep-learning frameworks have been shown to give superior performance in learning the dynamics of physical systems. There is a growing volume of literature that attempts to combine these two approaches. Here, we evaluate the performance of thirteen different graph neural networks, namely, Hamiltonian and Lagrangian graph neural networks, graph neural ODE, and their variants with explicit constraints and different architectures. We briefly explain the theoretical formulation highlighting the similarities and differences in the inductive biases and graph architecture of these systems. We evaluate these models on spring, pendulum, gravitational, and 3D deformable solid systems to compare the performance in terms of rollout error, conserved quantities such as energy and momentum, and generalizability to unseen system sizes. Our study demonstrates that GNNs with additional inductive biases, such as explicit constraints and decoupling of kinetic and potential energies, exhibit significantly enhanced performance. Further, all the physics-informed GNNs exhibit zero-shot generalizability to system sizes an order of magnitude larger than the training system, thus providing a promising route to simulate large-scale realistic systems.

知識 (knowledge) · Processing（編程語言） · 圖 · NLP · 知識圖譜 ·

2022 年 9 月 30 日

A Decade of Knowledge Graphs in Natural Language Processing: A Survey

Phillip Schneider,Tim Schopf,Juraj Vladika,Mikhail Galkin,Elena Simperl,Florian Matthes

from arxiv, Accepted to AACL-IJCNLP 2022

In pace with developments in the research field of artificial intelligence, knowledge graphs (KGs) have attracted a surge of interest from both academia and industry. As a representation of semantic relations between entities, KGs have proven to be particularly relevant for natural language processing (NLP), experiencing a rapid spread and wide adoption within recent years. Given the increasing amount of research work in this area, several KG-related approaches have been surveyed in the NLP research community. However, a comprehensive study that categorizes established topics and reviews the maturity of individual research streams remains absent to this day. Contributing to closing this gap, we systematically analyzed 507 papers from the literature on KGs in NLP. Our survey encompasses a multifaceted review of tasks, research types, and contributions. As a result, we present a structured overview of the research landscape, provide a taxonomy of tasks, summarize our findings, and highlight directions for future work.

Neural Networks · 圖 · Taxonomy · Performer · 圖形處理器 ·

2022 年 7 月 26 日

A Survey of Explainable Graph Neural Networks: Taxonomy and Evaluation Metrics

Yiqiao Li,Jianlong Zhou,Sunny Verma,Fang Chen

Graph neural networks (GNNs) have demonstrated a significant boost in prediction performance on graph data. At the same time, the predictions made by these models are often hard to interpret. In that regard, many efforts have been made to explain the prediction mechanisms of these models from perspectives such as GNNExplainer, XGNN and PGExplainer. Although such works present systematic frameworks to interpret GNNs, a holistic review for explainable GNNs is unavailable. In this survey, we present a comprehensive review of explainability techniques developed for GNNs. We focus on explainable graph neural networks and categorize them based on the use of explainable methods. We further provide the common performance metrics for GNNs explanations and point out several future research directions.

Neural Networks · Networking · 可約的 · Continuity · 推斷 ·

2021 年 6 月 21 日

A Survey of Quantization Methods for Efficient Neural Network Inference

Amir Gholami,Sehoon Kim,Zhen Dong,Zhewei Yao,Michael W. Mahoney,Kurt Keutzer

from arxiv, Book Chapter: Low-Power Computer Vision: Improving the Efficiency of Artificial Intelligence

As soon as abstract mathematical computations were adapted to computation on digital computers, the problem of efficient representation, manipulation, and communication of the numerical values in those computations arose. Strongly related to the problem of numerical representation is the problem of quantization: in what manner should a set of continuous real-valued numbers be distributed over a fixed discrete set of numbers to minimize the number of bits required and also to maximize the accuracy of the attendant computations? This perennial problem of quantization is particularly relevant whenever memory and/or computational resources are severely restricted, and it has come to the forefront in recent years due to the remarkable performance of Neural Network models in computer vision, natural language processing, and related areas. Moving from floating-point representations to low-precision fixed integer values represented in four bits or less holds the potential to reduce the memory footprint and latency by a factor of 16x; and, in fact, reductions of 4x to 8x are often realized in practice in these applications. Thus, it is not surprising that quantization has emerged recently as an important and very active sub-area of research in the efficient implementation of computations associated with Neural Networks. In this article, we survey approaches to the problem of quantizing the numerical values in deep Neural Network computations, covering the advantages/disadvantages of current methods. With this survey and its organization, we hope to have presented a useful snapshot of the current research in quantization for Neural Networks and to have given an intelligent organization to ease the evaluation of future research in this area.

Neural Networks · 圖 · Networks · 圖形處理器 · Networking ·

2021 年 1 月 25 日

A Review of Graph Neural Networks and Their Applications in Power Systems

Wenlong Liao,Birgitte Bak-Jensen,Jayakrishnan Radhakrishna Pillai,Yuelong Wang,Yusen Wang

Deep neural networks have revolutionized many machine learning tasks in power systems, ranging from pattern recognition to signal processing. The data in these tasks is typically represented in Euclidean domains. Nevertheless, there is an increasing number of applications in power systems, where data are collected from non-Euclidean domains and represented as the graph-structured data with high dimensional features and interdependency among nodes. The complexity of graph-structured data has brought significant challenges to the existing deep neural networks defined in Euclidean domains. Recently, many studies on extending deep neural networks for graph-structured data in power systems have emerged. In this paper, a comprehensive overview of graph neural networks (GNNs) in power systems is proposed. Specifically, several classical paradigms of GNNs structures (e.g., graph convolutional networks, graph recurrent neural networks, graph attention networks, graph generative networks, spatial-temporal graph convolutional networks, and hybrid forms of GNNs) are summarized, and key applications in power systems such as fault diagnosis, power prediction, power flow calculation, and data generation are reviewed in detail. Furthermore, main issues and some research trends about the applications of GNNs in power systems are discussed.

Vision · 模型評估 · 可約的 · 計算機視覺 · DNN ·

2020 年 3 月 24 日

A Survey of Methods for Low-Power Deep Learning and Computer Vision

Abhinav Goel,Caleb Tung,Yung-Hsiang Lu,George K. Thiruvathukal

from arxiv, Accepted for publication at 2020 IEEE 6th World Forum on Internet of Things (WF-IoT), New Orleans, LA, USA 2020

Deep neural networks (DNNs) are successful in many computer vision tasks. However, the most accurate DNNs require millions of parameters and operations, making them energy, computation and memory intensive. This impedes the deployment of large DNNs in low-power devices with limited compute resources. Recent research improves DNN models by reducing the memory requirement, energy consumption, and number of operations without significantly decreasing the accuracy. This paper surveys the progress of low-power deep learning and computer vision, specifically in regards to inference, and discusses the methods for compacting and accelerating DNN models. The techniques can be divided into four major categories: (1) parameter quantization and pruning, (2) compressed convolutional filters and matrix factorization, (3) network architecture search, and (4) knowledge distillation. We analyze the accuracy, advantages, disadvantages, and potential solutions to the problems with the techniques in each category. We also discuss new evaluation metrics as a guideline for future research.

Networking · Neural Networks · MoDELS · Performer · 模型性能 ·

2019 年 9 月 8 日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Yu Cheng,Duo Wang,Pan Zhou,Tao Zhang

from arxiv, Published in IEEE Signal Processing Magazine, arXiv version including some recent works

Deep convolutional neural networks (CNNs) have recently achieved great success in many visual recognition tasks. However, existing deep neural network models are computationally expensive and memory intensive, hindering their deployment in devices with low memory resources or in applications with strict latency requirements. Therefore, a natural thought is to perform model compression and acceleration in deep networks without significantly decreasing the model performance. During the past few years, tremendous progress has been made in this area. In this paper, we survey the recent advanced techniques for compacting and accelerating CNNs model developed. These techniques are roughly categorized into four schemes: parameter pruning and sharing, low-rank factorization, transferred/compact convolutional filters, and knowledge distillation. Methods of parameter pruning and sharing will be described at the beginning, after that the other techniques will be introduced. For each scheme, we provide insightful analysis regarding the performance, related applications, advantages, and drawbacks etc. Then we will go through a few very recent additional successful methods, for example, dynamic capacity networks and stochastic depths networks. After that, we survey the evaluation matrix, the main datasets used for evaluating the model performance and recent benchmarking efforts. Finally, we conclude this paper, discuss remaining challenges and possible directions on this topic.