99视频在线播放喷射_亚洲视频华人在线播放_精品人妻久久久久一区二区_国产丝袜控视频在线观看_日韩欧美精品一中文字幕_色婷婷五月综合久久俺_日本A一区在线视频免费观看

As deep learning models become popular, there is a lot of need for deploying them to diverse device environments. Because it is costly to develop and optimize a neural network for every single environment, there is a line of research to search neural networks for multiple target environments efficiently. However, existing works for such a situation still suffer from requiring many GPUs and expensive costs. Motivated by this, we propose a novel neural network optimization framework named Bespoke for low-cost deployment. Our framework searches for a lightweight model by replacing parts of an original model with randomly selected alternatives, each of which comes from a pretrained neural network or the original model. In the practical sense, Bespoke has two significant merits. One is that it requires near zero cost for designing the search space of neural networks. The other merit is that it exploits the sub-networks of public pretrained neural networks, so the total cost is minimal compared to the existing works. We conduct experiments exploring Bespoke's the merits, and the results show that it finds efficient models for multiple targets with meager cost.

相關內容

Neural Networks

關注 1648

神(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)（Neural Networks）是世界上(shang)三個最古(gu)老的(de)(de)(de)(de)(de)神(shen)經(jing)(jing)(jing)建(jian)模學(xue)(xue)(xue)會(hui)的(de)(de)(de)(de)(de)檔(dang)案期刊(kan):國際神(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)學(xue)(xue)(xue)會(hui)(INNS)、歐洲(zhou)神(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)學(xue)(xue)(xue)會(hui)(ENNS)和(he)(he)(he)日(ri)本神(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)學(xue)(xue)(xue)會(hui)(JNNS)。神(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)提供了一個論壇，以(yi)發(fa)(fa)展(zhan)和(he)(he)(he)培育一個國際社會(hui)的(de)(de)(de)(de)(de)學(xue)(xue)(xue)者和(he)(he)(he)實踐者感興趣的(de)(de)(de)(de)(de)所有(you)方面的(de)(de)(de)(de)(de)神(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)和(he)(he)(he)相關方法(fa)(fa)的(de)(de)(de)(de)(de)計算(suan)(suan)智能。神(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)歡迎高質量(liang)論文的(de)(de)(de)(de)(de)提交，有(you)助于全(quan)面的(de)(de)(de)(de)(de)神(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)研究，從行為(wei)和(he)(he)(he)大(da)腦(nao)建(jian)模，學(xue)(xue)(xue)習(xi)算(suan)(suan)法(fa)(fa)，通過數(shu)學(xue)(xue)(xue)和(he)(he)(he)計算(suan)(suan)分(fen)析，系統(tong)的(de)(de)(de)(de)(de)工(gong)程(cheng)和(he)(he)(he)技術應用，大(da)量(liang)使(shi)用神(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)的(de)(de)(de)(de)(de)概念和(he)(he)(he)技術。這一獨特而廣泛的(de)(de)(de)(de)(de)范圍(wei)促進(jin)了生物和(he)(he)(he)技術研究之(zhi)間的(de)(de)(de)(de)(de)思想(xiang)交流，并有(you)助于促進(jin)對生物啟發(fa)(fa)的(de)(de)(de)(de)(de)計算(suan)(suan)智能感興趣的(de)(de)(de)(de)(de)跨學(xue)(xue)(xue)科(ke)社區的(de)(de)(de)(de)(de)發(fa)(fa)展(zhan)。因此，神(shen)經(jing)(jing)(jing)網(wang)(wang)絡(luo)(luo)編(bian)委會(hui)代表的(de)(de)(de)(de)(de)專家(jia)領域包括心理學(xue)(xue)(xue)，神(shen)經(jing)(jing)(jing)生物學(xue)(xue)(xue)，計算(suan)(suan)機科(ke)學(xue)(xue)(xue)，工(gong)程(cheng)，數(shu)學(xue)(xue)(xue)，物理。該(gai)雜志(zhi)發(fa)(fa)表文章(zhang)、信件和(he)(he)(he)評論以(yi)及給編(bian)輯的(de)(de)(de)(de)(de)信件、社論、時事、軟(ruan)件調查和(he)(he)(he)專利信息。文章(zhang)發(fa)(fa)表在五個部(bu)分(fen)之(zhi)一:認(ren)知(zhi)科(ke)學(xue)(xue)(xue)，神(shen)經(jing)(jing)(jing)科(ke)學(xue)(xue)(xue)，學(xue)(xue)(xue)習(xi)系統(tong)，數(shu)學(xue)(xue)(xue)和(he)(he)(he)計算(suan)(suan)分(fen)析、工(gong)程(cheng)和(he)(he)(he)應用。官網(wang)(wang)地(di)址(zhi)：

Processing（編程語言） · Attention · 變換 · 可約的 · Extensibility ·

2024 年 1 月 10 日

Attendre: Wait To Attend By Retrieval With Evicted Queries in Memory-Based Transformers for Long Context Processing

Zi Yang,Nan Hua

As LLMs have become capable of processing more complex types of inputs, researchers have recently studied how to efficiently and affordably process possibly arbitrarily long sequences. One effective approach is to use a FIFO memory to store keys and values of an attention sublayer from past chunks to allow subsequent queries to attend. However, this approach requires a large memory and/or takes into the consideration the specific LM architecture. Moreover, due to the causal nature between the key-values in prior context and the queries at present, this approach cannot be extended to bidirectional attention such as in an encoder-decoder or PrefixLM decoder-only architecture. In this paper, we propose to use eviction policies, such as LRA and LFA, to reduce the memory size and adapt to various architectures, and we also propose the Attendre layer, a wait-to-attend mechanism by retrieving the key-value memory (K/V memory) with evicted queries in the query memory (Q memory). As a first step, we evaluate this method in the context length extension setup using the TriviaQA reading comprehension task, and show the effectiveness of the approach.

對抗自編碼 · MoDELS · Integration · Better · Learning ·

2024 年 1 月 8 日

Time-Transformer: Integrating Local and Global Features for Better Time Series Generation

Yuansan Liu,Sudanthi Wijewickrema,Ang Li,Christofer Bester,Stephen O'Leary,James Bailey

from arxiv, 15 pages, 7 figures and 16 tables. SDM24

Generating time series data is a promising approach to address data deficiency problems. However, it is also challenging due to the complex temporal properties of time series data, including local correlations as well as global dependencies. Most existing generative models have failed to effectively learn both the local and global properties of time series data. To address this open problem, we propose a novel time series generative model named 'Time-Transformer AAE', which consists of an adversarial autoencoder (AAE) and a newly designed architecture named 'Time-Transformer' within the decoder. The Time-Transformer first simultaneously learns local and global features in a layer-wise parallel design, combining the abilities of Temporal Convolutional Networks and Transformer in extracting local features and global dependencies respectively. Second, a bidirectional cross attention is proposed to provide complementary guidance across the two branches and achieve proper fusion between local and global features. Experimental results demonstrate that our model can outperform existing state-of-the-art models in 5 out of 6 datasets, specifically on those with data containing both global and local properties. Furthermore, we highlight our model's advantage on handling this kind of data via an artificial dataset. Finally, we show our model's ability to address a real-world problem: data augmentation to support learning with small datasets and imbalanced datasets.

分離的 · 噪聲 · Learning · INFORMS · contrastive ·

2024 年 1 月 8 日

Noise-Aware Speech Separation with Contrastive Learning

Zizheng Zhang,Chen Chen,Hsin-Hung Chen,Xiang Liu,Yuchen Hu,Eng Siong Chng

from arxiv, 5 pages, 3 figures, ICASSP 2024

Recently, speech separation (SS) task has achieved remarkable progress driven by deep learning technique. However, it is still challenging to separate target speech from noisy mixture, as the neural model is vulnerable to assign background noise to each speaker. In this paper, we propose a noise-aware SS (NASS) method, which aims to improve the speech quality for separated signals under noisy conditions. Specifically, NASS views background noise as an additional output and predicts it along with other speakers in a mask-based manner. To effectively denoise, we introduce patch-wise contrastive learning (PCL) between noise and speaker representations from the decoder input and encoder output. PCL loss aims to minimize the mutual information between predicted noise and other speakers at multiple-patch level to suppress the noise information in separated signals. Experimental results show that NASS achieves 1 to 2dB SI-SNRi or SDRi over DPRNN and Sepformer on WHAM! and LibriMix noisy datasets, with less than 0.1M parameter increase.

Learning · ONCE · 路徑 · INFORMS · 泛化理論 ·

2024 年 1 月 8 日

Learn Once Plan Arbitrarily (LOPA): Attention-Enhanced Deep Reinforcement Learning Method for Global Path Planning

Guoming Huang,Mingxin Hou,Xiaofang Yuan,Shuqiao Huang,Yaonan Wang

Deep reinforcement learning (DRL) methods have recently shown promise in path planning tasks. However, when dealing with global planning tasks, these methods face serious challenges such as poor convergence and generalization. To this end, we propose an attention-enhanced DRL method called LOPA (Learn Once Plan Arbitrarily) in this paper. Firstly, we analyze the reasons of these problems from the perspective of DRL's observation, revealing that the traditional design causes DRL to be interfered by irrelevant map information. Secondly, we develop the LOPA which utilizes a novel attention-enhanced mechanism to attain an improved attention capability towards the key information of the observation. Such a mechanism is realized by two steps: (1) an attention model is built to transform the DRL's observation into two dynamic views: local and global, significantly guiding the LOPA to focus on the key information on the given maps; (2) a dual-channel network is constructed to process these two views and integrate them to attain an improved reasoning capability. The LOPA is validated via multi-objective global path planning experiments. The result suggests the LOPA has improved convergence and generalization performance as well as great path planning efficiency.

流 · 變換 · 在線 · MoDELS · Performer ·

2024 年 1 月 5 日

BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer

Chih-Cheng Chang,Li Su

from arxiv, Accepted by ICASSP 2024

Many deep learning models have achieved dominant performance on the offline beat tracking task. However, online beat tracking, in which only the past and present input features are available, still remains challenging. In this paper, we propose BEAt tracking Streaming Transformer (BEAST), an online joint beat and downbeat tracking system based on the streaming Transformer. To deal with online scenarios, BEAST applies contextual block processing in the Transformer encoder. Moreover, we adopt relative positional encoding in the attention layer of the streaming Transformer encoder to capture relative timing position which is critically important information in music. Carrying out beat and downbeat experiments on benchmark datasets for a low latency scenario with maximum latency under 50 ms, BEAST achieves an F1-measure of 80.04% in beat and 52.73% in downbeat, which is a substantial improvement of about 5 and 13 percentage points over the state-of-the-art online beat and downbeat tracking model.

Automator · Learning · Machine Learning · 圖 · AutoML ·

2024 年 1 月 5 日

AutoGL: A Library for Automated Graph Learning

Ziwei Zhang,Yijian Qin,Zeyang Zhang,Chaoyu Guan,Jie Cai,Heng Chang,Jiyan Jiang,Haoyang Li,Zixin Sun,Beini Xie,Yang Yao,Yipeng Zhang,Xin Wang,Wenwu Zhu

from arxiv, Extended version; initial version published at ICLR 2021 Workshop on Geometrical and Topological Representation Learning

Recent years have witnessed an upsurge in research interests and applications of machine learning on graphs. However, manually designing the optimal machine learning algorithms for different graph datasets and tasks is inflexible, labor-intensive, and requires expert knowledge, limiting its adaptivity and applicability. Automated machine learning (AutoML) on graphs, aiming to automatically design the optimal machine learning algorithm for a given graph dataset and task, has received considerable attention. However, none of the existing libraries can fully support AutoML on graphs. To fill this gap, we present Automated Graph Learning (AutoGL), the first dedicated library for automated machine learning on graphs. AutoGL is open-source, easy to use, and flexible to be extended. Specifically, we propose a three-layer architecture, consisting of backends to interface with devices, a complete automated graph learning pipeline, and supported graph applications. The automated machine learning pipeline further contains five functional modules: auto feature engineering, neural architecture search, hyper-parameter optimization, model training, and auto ensemble, covering the majority of existing AutoML methods on graphs. For each module, we provide numerous state-of-the-art methods and flexible base classes and APIs, which allow easy usage and customization. We further provide experimental results to showcase the usage of our AutoGL library. We also present AutoGL-light, a lightweight version of AutoGL to facilitate customizing pipelines and enriching applications, as well as benchmarks for graph neural architecture search. The codes of AutoGL are publicly available at //github.com/THUMNLab/AutoGL.

Pyramid · MoDELS · Extensibility · state-of-the-art · Performer ·

2022 年 12 月 1 日

Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis

Wan-Cyuan Fan,Yen-Chun Chen,Dongdong Chen,Yu Cheng,Lu Yuan,Yu-Chiang Frank Wang

from arxiv, AAAI 2023

Diffusion models (DMs) have shown great potential for high-quality image synthesis. However, when it comes to producing images with complex scenes, how to properly describe both image global structures and object details remains a challenging task. In this paper, we present Frido, a Feature Pyramid Diffusion model performing a multi-scale coarse-to-fine denoising process for image synthesis. Our model decomposes an input image into scale-dependent vector quantized features, followed by a coarse-to-fine gating for producing image output. During the above multi-scale representation learning stage, additional input conditions like text, scene graph, or image layout can be further exploited. Thus, Frido can be also applied for conditional or cross-modality image synthesis. We conduct extensive experiments over various unconditioned and conditional image generation tasks, ranging from text-to-image synthesis, layout-to-image, scene-graph-to-image, to label-to-image. More specifically, we achieved state-of-the-art FID scores on five benchmarks, namely layout-to-image on COCO and OpenImages, scene-graph-to-image on COCO and Visual Genome, and label-to-image on COCO. Code is available at //github.com/davidhalladay/Frido.

多峰值 · Learning · 多模態學習 · 變換 · Transformer ·

2022 年 6 月 13 日

Multimodal Learning with Transformers: A Survey

Peng Xu,Xiatian Zhu,David A. Clifton

Transformer is a promising neural network learner, and has achieved great success in various machine learning tasks. Thanks to the recent prevalence of multimodal applications and big data, Transformer-based multimodal learning has become a hot topic in AI research. This paper presents a comprehensive survey of Transformer techniques oriented at multimodal data. The main contents of this survey include: (1) a background of multimodal learning, Transformer ecosystem, and the multimodal big data era, (2) a theoretical review of Vanilla Transformer, Vision Transformer, and multimodal Transformers, from a geometrically topological perspective, (3) a review of multimodal Transformer applications, via two important paradigms, i.e., for multimodal pretraining and for specific multimodal tasks, (4) a summary of the common challenges and designs shared by the multimodal Transformer models and applications, and (5) a discussion of open problems and potential research directions for the community.

學成 · NLP · Processing（編程語言） · 元學習 · AIM ·

2022 年 5 月 3 日

Meta Learning for Natural Language Processing: A Survey

Hung-yi Lee,Shang-Wen Li,Ngoc Thang Vu

from arxiv, Accepted by NAACL 2022

Deep learning has been the mainstream technique in natural language processing (NLP) area. However, the techniques require many labeled data and are less generalizable across domains. Meta-learning is an arising field in machine learning studying approaches to learn better learning algorithms. Approaches aim at improving algorithms in various aspects, including data efficiency and generalizability. Efficacy of approaches has been shown in many NLP tasks, but there is no systematic survey of these approaches in NLP, which hinders more researchers from joining the field. Our goal with this survey paper is to offer researchers pointers to relevant meta-learning works in NLP and attract more attention from the NLP community to drive future innovation. This paper first introduces the general concepts of meta-learning and the common approaches. Then we summarize task construction settings and application of meta-learning for various NLP problems and review the development of meta-learning in NLP community.

FPGA · 卷積神經網絡 · Neural Networks · 卷積 · 層 ·

2016 年 9 月 30 日

Caffeinated FPGAs: FPGA Framework For Convolutional Neural Networks

Roberto DiCecco,Griffin Lacey,Jasmina Vasiljevic,Paul Chow,Graham Taylor,Shawki Areibi

Convolutional Neural Networks (CNNs) have gained significant traction in the field of machine learning, particularly due to their high accuracy in visual recognition. Recent works have pushed the performance of GPU implementations of CNNs to significantly improve their classification and training times. With these improvements, many frameworks have become available for implementing CNNs on both CPUs and GPUs, with no support for FPGA implementations. In this work we present a modified version of the popular CNN framework Caffe, with FPGA support. This allows for classification using CNN models and specialized FPGA implementations with the flexibility of reprogramming the device when necessary, seamless memory transactions between host and device, simple-to-use test benches, and the ability to create pipelined layer implementations. To validate the framework, we use the Xilinx SDAccel environment to implement an FPGA-based Winograd convolution engine and show that the FPGA layer can be used alongside other layers running on a host processor to run several popular CNNs (AlexNet, GoogleNet, VGG A, Overfeat). The results show that our framework achieves 50 GFLOPS across 3x3 convolutions in the benchmarks. This is achieved within a practical framework, which will aid in future development of FPGA-based CNNs.