国产高清一区二区在线影院_久久久久精品波多野吉衣无码AV_久久精品国产亚洲a∨麻豆_中文字幕在线第一页_小视频在线观看免费_亚洲AV无码国产综合专区四季_精品国产美女福到在线不卡蜜芽

Simon F Muller-Cleve,Vittorio Fra,Lyes Khacef,Alejandro Pequeno-Zurro,Daniel Klepatsch,Evelina Forno,Diego G Ivanovich,Shavika Rastogi,Gianvito Urgese,Friedemann Zenke,Chiara Bartolozzi

Spatio-temporal pattern recognition is a fundamental ability of the brain which is required for numerous real-world activities. Recent deep learning approaches have reached outstanding accuracies in such tasks, but their implementation on conventional embedded solutions is still very computationally and energy expensive. Tactile sensing in robotic applications is a representative example where real-time processing and energy efficiency are required. Following a brain-inspired computing approach, we propose a new benchmark for spatio-temporal tactile pattern recognition at the edge through Braille letter reading. We recorded a new Braille letters dataset based on the capacitive tactile sensors of the iCub robot's fingertip. We then investigated the importance of spatial and temporal information as well as the impact of event-based encoding on spike-based computation. Afterward, we trained and compared feedforward and recurrent Spiking Neural Networks (SNNs) offline using Backpropagation Through Time (BPTT) with surrogate gradients, then we deployed them on the Intel Loihi neuromorphic chip for fast and efficient inference. We compared our approach to standard classifiers, in particular to the Long Short-Term Memory (LSTM) deployed on the embedded NVIDIA Jetson GPU, in terms of classification accuracy, power, energy consumption, and delay. Our results show that the LSTM reaches ~97% of accuracy, outperforming the recurrent SNN by ~17% when using continuous frame-based data instead of event-based inputs. However, the recurrent SNN on Loihi with event-based inputs is ~500 times more energy-efficient than the LSTM on Jetson, requiring a total power of only ~30 mW. This work proposes a new benchmark for tactile sensing and highlights the challenges and opportunities of event-based encoding, neuromorphic hardware, and spike-based computing for spatio-temporal pattern recognition at the edge.

相關內容

Pattern Recognition

關注 986

模式(shi)(shi)識(shi)別(bie)是一個成熟的(de)、令人(ren)興(xing)奮的(de)、快(kuai)速發展的(de)領域(yu)，它支撐著計算機(ji)視覺、圖像(xiang)處理、文本和文檔(dang)分析以及神經網(wang)絡等相(xiang)關領域(yu)的(de)發展。它與(yu)機(ji)器學習非常相(xiang)似(si)，在(zai)(zai)生物(wu)識(shi)別(bie)、生物(wu)信(xin)息學、多(duo)媒體數(shu)據分析和最新(xin)(xin)的(de)數(shu)據科學等新(xin)(xin)興(xing)領域(yu)也有應(ying)用。模式(shi)(shi)識(shi)別(bie)（Pattern Recognition）雜志成立于大(da)約50年前，當(dang)時(shi)該領域(yu)剛剛出(chu)現計算機(ji)科學的(de)早期(qi)。在(zai)(zai)這(zhe)期(qi)間，它已大(da)大(da)擴大(da)。只(zhi)要這(zhe)些(xie)論(lun)(lun)文的(de)背(bei)景得到(dao)了清晰的(de)解釋并以模式(shi)(shi)識(shi)別(bie)文獻為基礎(chu)，該雜志接受那些(xie)對模式(shi)(shi)識(shi)別(bie)理論(lun)(lun)、方法和在(zai)(zai)任(ren)何領域(yu)的(de)應(ying)用做出(chu)原創貢獻的(de)論(lun)(lun)文。官網(wang)地(di)址(zhi)：

Learning · 多任務學習 · Wireless Networks · Networking · MAML ·

2022 年 12 月 2 日

On the Energy and Communication Efficiency Tradeoffs in Federated and Multi-Task Learning

Stefano Savazzi,Vittorio Rampa,Sanaz Kianoush,Mehdi Bennis

from arxiv, Proceedings of IEEE International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), 2022

Recent advances in Federated Learning (FL) have paved the way towards the design of novel strategies for solving multiple learning tasks simultaneously, by leveraging cooperation among networked devices. Multi-Task Learning (MTL) exploits relevant commonalities across tasks to improve efficiency compared with traditional transfer learning approaches. By learning multiple tasks jointly, significant reduction in terms of energy footprints can be obtained. This article provides a first look into the energy costs of MTL processes driven by the Model-Agnostic Meta-Learning (MAML) paradigm and implemented in distributed wireless networks. The paper targets a clustered multi-task network setup where autonomous agents learn different but related tasks. The MTL process is carried out in two stages: the optimization of a meta-model that can be quickly adapted to learn new tasks, and a task-specific model adaptation stage where the learned meta-model is transferred to agents and tailored for a specific task. This work analyzes the main factors that influence the MTL energy balance by considering a multi-task Reinforcement Learning (RL) setup in a robotized environment. Results show that the MAML method can reduce the energy bill by at least 2 times compared with traditional approaches without inductive transfer. Moreover, it is shown that the optimal energy balance in wireless networks depends on uplink/downlink and sidelink communication efficiencies.

分解的 · 回合 · INFORMS · Performer · 標準差 ·

2022 年 12 月 2 日

The effect of speech pathology on automatic speaker verification -- a large-scale study

Soroosh Tayebi Arasteh,Tobias Weise,Maria Schuster,Elmar Noeth,Andreas Maier,Seung Hee Yang

With the advancements in deep learning (DL) and an increasing interest in data-driven speech processing methods, there is a major challenge in accessing pathological speech data. Public challenge data offers a potential remedy for this but may expose patient health information by re-identification attacks. Therefore, we investigate in this study whether or not pathological speech is more vulnerable to such re-identification than healthy speech. Our study is the first large-scale investigation on the effects of different speech pathology on automatic speaker verification (ASV) using a real-world pathological speech corpus of more than 2,000 test subjects with various speech and voice disorders from different ages. Utilizing a DL-based ASV method, we obtained a mean equal error rate (EER) of 0.89% with a standard deviation of 0.06%, which is a factor of three lower than comparable healthy speech databases. We further perform detailed analyses of external influencing factors on ASV such as age, pathology, recording environment, utterance length, and intelligibility, to explore their respective effect. Our experiments indicate that some types of speech pathology, in particular dysphonia, regardless of speech intelligibility, are more vulnerable to a breach of privacy compared to healthy speech. We also observe that the effect of pathology lies in the range of other factors, such as age, microphone, and recording environment.

邊 · 邊緣計算 · Continuity · INTERACT · 評論員 ·

2022 年 12 月 1 日

A Survey of Mobile Edge Computing for the Metaverse: Architectures, Applications, and Challenges

Yitong Wang,Jun Zhao

from arxiv, This paper appears in the 2022 IEEE 8th International Conference on Collaboration and Internet Computing (CIC)

Metaverse is an emerging virtual universe where humans can have real-time interactions and solid social links like in the physical world, and it opens up a new era of Internet and interactions. In Metaverse, an immersive and photorealistic environment promotes social activities, including education, meetings, and shopping of digital avatars based on critical technologies, including 3D rendering, extended reality, digital twins, artificial intelligence, and Blockchain. However, the limitations of computation, storage, and energy resources restrict the development of Metaverse, and a series of system issues (e.g., latency, security, and battery-life) continue to arise. As a result, how to find corresponding measurements to mitigate unsatisfactory influences becomes the focus. Mobile edge computing (MEC) as a distributed computing paradigm offloads computation-intensive tasks to the edge of the network. It brings the resources as close as possible to the end devices, addressing the shortcomings mentioned above. In this paper, we propose a comprehensive survey of the MEC-based Metaverse. Particular emphasis is given to the technologies convergence, architectures, and application scenarios, e.g., BoundlessXR and CloudXR. Significantly, we introduce the potential future directions for developing Metaverse systems.

總體代價 · 代價 · 優化器 · MoDELS · 可辨認的 ·

2022 年 12 月 1 日

Dial-a-ride problem with modular platooning and en-route transfers

Zhexi Fu,Joseph Y. J. Chow

The emerging modular vehicle (MV) technology possesses the ability to physically connect/disconnect with each other and thus travel in platoon for less energy consumption. Moreover, a platoon of MVs can be regarded as a new bus-like platform with expanded on-board carrying capacity and provide larger service throughput according to the demand density. This innovation concept might solve the mismatch problems between the fixed vehicle capacity and the temporal-spatial variations of demand in current transportation system. To obtain the optimal assignments and routes for the operation of MVs, a mixed integer linear programming (MILP) model is formulated to minimize the weighted total cost of vehicle travel cost and passenger service time. The temporal and spatial synchronization of vehicle platoons and passenger en-route transfers are determined and optimized by the MILP model while constructing the paths. Heuristic algorithms based on large neighborhood search are developed to solve the modular dial-a-ride problem (MDARP) for practical scenarios. A set of small-scale synthetic numerical experiments are tested to evaluate the optimality gap and computation time between our proposed MILP model and heuristic algorithms. Large-scale experiments are conducted on the Anaheim network with 378 candidate join/split nodes to further explore the potentials and identify the ideal operation scenarios of MVs. The results show that the innovative MV technology can save up to 52.0% in vehicle travel cost, 35.6% in passenger service time, and 29.4% in total cost against existing on-demand mobility services. Results suggest that MVs best benefit from platooning by serving enclave pairs as a hub-and-spoke service.

端到端 · 數據集 · Extensibility · 實體解析 · Continuity ·

2022 年 12 月 1 日

PIZZA: A new benchmark for complex end-to-end task-oriented parsing

Konstantine Arkoudas,Nicolas Guenon des Mesnards,Melanie Rubino,Sandesh Swamy,Saarthak Khanna,Weiqi Sun,Khan Haidar

from arxiv, Accepted for publication at AMLC 2022

Much recent work in task-oriented parsing has focused on finding a middle ground between flat slots and intents, which are inexpressive but easy to annotate, and powerful representations such as the lambda calculus, which are expressive but costly to annotate. This paper continues the exploration of task-oriented parsing by introducing a new dataset for parsing pizza and drink orders, whose semantics cannot be captured by flat slots and intents. We perform an extensive evaluation of deep-learning techniques for task-oriented parsing on this dataset, including different flavors of seq2seq systems and RNNGs. The dataset comes in two main versions, one in a recently introduced utterance-level hierarchical notation that we call TOP, and one whose targets are executable representations (EXR). We demonstrate empirically that training the parser to directly generate EXR notation not only solves the problem of entity resolution in one fell swoop and overcomes a number of expressive limitations of TOP notation, but also results in significantly greater parsing accuracy.

IR · 設計 · Performer · Integration · TOOLS ·

2022 年 11 月 30 日

Canal: A Flexible Interconnect Generator for Coarse-Grained Reconfigurable Arrays

Jackson Melchert,Keyi Zhang,Yuchen Mei,Mark Horowitz,Christopher Torng,Priyanka Raina

from arxiv, Preprint version

The architecture of a coarse-grained reconfigurable array (CGRA) interconnect has a significant effect on not only the flexibility of the resulting accelerator, but also its power, performance, and area. Design decisions that have complex trade-offs need to be explored to maintain efficiency and performance across a variety of evolving applications. This paper presents Canal, a Python-embedded domain-specific language (eDSL) and compiler for specifying and generating reconfigurable interconnects for CGRAs. Canal uses a graph-based intermediate representation (IR) that allows for easy hardware generation and tight integration with place and route tools. We evaluate Canal by constructing both a fully static interconnect and a hybrid interconnect with ready-valid signaling, and by conducting design space exploration of the interconnect architecture by modifying the switch box topology, the number of routing tracks, and the interconnect tile connections. Through the use of a graph-based IR for CGRA interconnects, the eDSL, and the interconnect generation system, Canal enables fast design space exploration and creation of CGRA interconnects.

圖 · Taxonomy · Performer · Learning · 表示學習 ·

2022 年 11 月 30 日

Taxonomy of Benchmarks in Graph Representation Learning

Renming Liu,Semih Cantürk,Frederik Wenkel,Sarah McGuire,Xinyi Wang,Anna Little,Leslie O'Bray,Michael Perlmutter,Bastian Rieck,Matthew Hirn,Guy Wolf,Ladislav Rampá?ek

from arxiv, In Proceedings of the First Learning on Graphs Conference (LoG 2022)

Graph Neural Networks (GNNs) extend the success of neural networks to graph-structured data by accounting for their intrinsic geometry. While extensive research has been done on developing GNN models with superior performance according to a collection of graph representation learning benchmarks, it is currently not well understood what aspects of a given model are probed by them. For example, to what extent do they test the ability of a model to leverage graph structure vs. node features? Here, we develop a principled approach to taxonomize benchmarking datasets according to a $\textit{sensitivity profile}$ that is based on how much GNN performance changes due to a collection of graph perturbations. Our data-driven analysis provides a deeper understanding of which benchmarking data characteristics are leveraged by GNNs. Consequently, our taxonomy can aid in selection and development of adequate graph benchmarks, and better informed evaluation of future GNN methods. Finally, our approach and implementation in $\texttt{GTaxoGym}$ package are extendable to multiple graph prediction task types and future datasets.

貪心 · 模態 · MoDELS · 學成 · 泛化理論 ·

2022 年 2 月 10 日

Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks

Nan Wu,Stanis?aw Jastrz?bski,Kyunghyun Cho,Krzysztof J. Geras

We hypothesize that due to the greedy nature of learning in multi-modal deep neural networks, these models tend to rely on just one modality while under-fitting the other modalities. Such behavior is counter-intuitive and hurts the models' generalization, as we observe empirically. To estimate the model's dependence on each modality, we compute the gain on the accuracy when the model has access to it in addition to another modality. We refer to this gain as the conditional utilization rate. In the experiments, we consistently observe an imbalance in conditional utilization rates between modalities, across multiple tasks and architectures. Since conditional utilization rate cannot be computed efficiently during training, we introduce a proxy for it based on the pace at which the model learns from each modality, which we refer to as the conditional learning speed. We propose an algorithm to balance the conditional learning speeds between modalities during training and demonstrate that it indeed addresses the issue of greedy learning. The proposed algorithm improves the model's generalization on three datasets: Colored MNIST, Princeton ModelNet40, and NVIDIA Dynamic Hand Gesture.

Extensibility · 鏈路預測 · Performer · 任務對話系統 · MoDELS ·

2019 年 12 月 17 日

Differentiable Reasoning on Large Knowledge Bases and Natural Language

Pasquale Minervini,Matko Bo?njak,Tim Rockt?schel,Sebastian Riedel,Edward Grefenstette

from arxiv, Accepted at the 34th AAAI Conference on Artificial Intelligence (AAAI-20)

Reasoning with knowledge expressed in natural language and Knowledge Bases (KBs) is a major challenge for Artificial Intelligence, with applications in machine reading, dialogue, and question answering. General neural architectures that jointly learn representations and transformations of text are very data-inefficient, and it is hard to analyse their reasoning process. These issues are addressed by end-to-end differentiable reasoning systems such as Neural Theorem Provers (NTPs), although they can only be used with small-scale symbolic KBs. In this paper we first propose Greedy NTPs (GNTPs), an extension to NTPs addressing their complexity and scalability limitations, thus making them applicable to real-world datasets. This result is achieved by dynamically constructing the computation graph of NTPs and including only the most promising proof paths during inference, thus obtaining orders of magnitude more efficient models. Then, we propose a novel approach for jointly reasoning over KBs and textual mentions, by embedding logic facts and natural language sentences in a shared embedding space. We show that GNTPs perform on par with NTPs at a fraction of their cost while achieving competitive link prediction results on large datasets, providing explanations for predictions, and inducing interpretable models. Source code, datasets, and supplementary material are available online at //github.com/uclnlp/gntp.

圖卷積神經網絡/圖卷積網絡 · 文本分類 · 圖卷積網絡 · 圖卷積 · 圖 ·

2018 年 9 月 15 日

Graph Convolutional Networks for Text Classification

Liang Yao,Chengsheng Mao,Yuan Luo

Text Classification is an important and classical problem in natural language processing. There have been a number of studies that applied convolutional neural networks (convolution on regular grid, e.g., sequence) to classification. However, only a limited number of studies have explored the more flexible graph convolutional neural networks (e.g., convolution on non-grid, e.g., arbitrary graph) for the task. In this work, we propose to use graph convolutional networks for text classification. We build a single text graph for a corpus based on word co-occurrence and document word relations, then learn a Text Graph Convolutional Network (Text GCN) for the corpus. Our Text GCN is initialized with one-hot representation for word and document, it then jointly learns the embeddings for both words and documents, as supervised by the known class labels for documents. Our experimental results on multiple benchmark datasets demonstrate that a vanilla Text GCN without any external word embeddings or knowledge outperforms state-of-the-art methods for text classification. On the other hand, Text GCN also learns predictive word and document embeddings. In addition, experimental results show that the improvement of Text GCN over state-of-the-art comparison methods become more prominent as we lower the percentage of training data, suggesting the robustness of Text GCN to less training data in text classification.