亚洲黄色网站不卡免费_在线观看亚洲国产成人精品_久久综合中文久久一本_一级AA免费毛片高潮视频_在线播放一区二区三区日韩免费视频_欧美亚洲国产一区二区三_亚洲国产日韩欧美永久在线观看

from arxiv, 8 pages, 4 figures, 2 tables, 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 1: GRAPP, ISBN 978-989-758-555-5, ISSN 2184-4321, pages 286-293

In the past few years, neural character animation has emerged and offered an automatic method for animating virtual characters. Their motion is synthesized by a neural network. Controlling this movement in real time with a user-defined control signal is also an important task in video games for example. Solutions based on fully-connected layers (MLPs) and Mixture-of-Experts (MoE) have given impressive results in generating and controlling various movements with close-range interactions between the environment and the virtual character. However, a major shortcoming of fully-connected layers is their computational and memory cost which may lead to sub-optimized solution. In this work, we apply pruning algorithms to compress an MLP- MoE neural network in the context of interactive character animation, which reduces its number of parameters and accelerates its computation time with a trade-off between this acceleration and the synthesized motion quality. This work demonstrates that, with the same number of experts and parameters, the pruned model produces less motion artifacts than the dense model and the learned high-level motion features are similar for both

相關內容

Neural Networks

關注 1648

神(shen)(shen)(shen)(shen)經(jing)網(wang)(wang)(wang)(wang)絡（Neural Networks）是世界上三(san)個(ge)(ge)最古老的(de)(de)(de)神(shen)(shen)(shen)(shen)經(jing)建模學(xue)(xue)(xue)(xue)(xue)(xue)(xue)會(hui)(hui)的(de)(de)(de)檔案期刊:國際神(shen)(shen)(shen)(shen)經(jing)網(wang)(wang)(wang)(wang)絡學(xue)(xue)(xue)(xue)(xue)(xue)(xue)會(hui)(hui)(INNS)、歐洲神(shen)(shen)(shen)(shen)經(jing)網(wang)(wang)(wang)(wang)絡學(xue)(xue)(xue)(xue)(xue)(xue)(xue)會(hui)(hui)(ENNS)和(he)日本神(shen)(shen)(shen)(shen)經(jing)網(wang)(wang)(wang)(wang)絡學(xue)(xue)(xue)(xue)(xue)(xue)(xue)會(hui)(hui)(JNNS)。神(shen)(shen)(shen)(shen)經(jing)網(wang)(wang)(wang)(wang)絡提供了一(yi)個(ge)(ge)論壇(tan)，以(yi)發(fa)(fa)展和(he)培育一(yi)個(ge)(ge)國際社會(hui)(hui)的(de)(de)(de)學(xue)(xue)(xue)(xue)(xue)(xue)(xue)者和(he)實踐(jian)者感興趣(qu)的(de)(de)(de)所有(you)方面的(de)(de)(de)神(shen)(shen)(shen)(shen)經(jing)網(wang)(wang)(wang)(wang)絡和(he)相(xiang)關方法的(de)(de)(de)計(ji)算智能。神(shen)(shen)(shen)(shen)經(jing)網(wang)(wang)(wang)(wang)絡歡(huan)迎高質(zhi)量(liang)論文的(de)(de)(de)提交，有(you)助(zhu)于(yu)全面的(de)(de)(de)神(shen)(shen)(shen)(shen)經(jing)網(wang)(wang)(wang)(wang)絡研(yan)究，從(cong)行為和(he)大腦建模，學(xue)(xue)(xue)(xue)(xue)(xue)(xue)習算法，通過數(shu)(shu)學(xue)(xue)(xue)(xue)(xue)(xue)(xue)和(he)計(ji)算分(fen)析(xi)，系(xi)統的(de)(de)(de)工(gong)程和(he)技(ji)術應(ying)用，大量(liang)使用神(shen)(shen)(shen)(shen)經(jing)網(wang)(wang)(wang)(wang)絡的(de)(de)(de)概念(nian)和(he)技(ji)術。這一(yi)獨(du)特而廣泛的(de)(de)(de)范圍促進了生(sheng)物(wu)和(he)技(ji)術研(yan)究之(zhi)間的(de)(de)(de)思想交流，并有(you)助(zhu)于(yu)促進對生(sheng)物(wu)啟發(fa)(fa)的(de)(de)(de)計(ji)算智能感興趣(qu)的(de)(de)(de)跨(kua)學(xue)(xue)(xue)(xue)(xue)(xue)(xue)科社區(qu)的(de)(de)(de)發(fa)(fa)展。因此，神(shen)(shen)(shen)(shen)經(jing)網(wang)(wang)(wang)(wang)絡編委會(hui)(hui)代表(biao)的(de)(de)(de)專家領(ling)域包括心(xin)理(li)學(xue)(xue)(xue)(xue)(xue)(xue)(xue)，神(shen)(shen)(shen)(shen)經(jing)生(sheng)物(wu)學(xue)(xue)(xue)(xue)(xue)(xue)(xue)，計(ji)算機科學(xue)(xue)(xue)(xue)(xue)(xue)(xue)，工(gong)程，數(shu)(shu)學(xue)(xue)(xue)(xue)(xue)(xue)(xue)，物(wu)理(li)。該雜(za)志(zhi)發(fa)(fa)表(biao)文章、信件(jian)和(he)評論以(yi)及給編輯的(de)(de)(de)信件(jian)、社論、時事、軟(ruan)件(jian)調查和(he)專利信息(xi)。文章發(fa)(fa)表(biao)在(zai)五個(ge)(ge)部分(fen)之(zhi)一(yi):認知科學(xue)(xue)(xue)(xue)(xue)(xue)(xue)，神(shen)(shen)(shen)(shen)經(jing)科學(xue)(xue)(xue)(xue)(xue)(xue)(xue)，學(xue)(xue)(xue)(xue)(xue)(xue)(xue)習系(xi)統，數(shu)(shu)學(xue)(xue)(xue)(xue)(xue)(xue)(xue)和(he)計(ji)算分(fen)析(xi)、工(gong)程和(he)應(ying)用。官網(wang)(wang)(wang)(wang)地(di)址(zhi)：

混合專家模型 · Weight · 變換 · Less · 代價 ·

2022 年 4 月 20 日

Residual Mixture of Experts

Lemeng Wu,Mengchen Liu,Yinpeng Chen,Dongdong Chen,Xiyang Dai,Lu Yuan

Mixture of Experts (MoE) is able to scale up vision transformers effectively. However, it requires prohibiting computation resources to train a large MoE transformer. In this paper, we propose Residual Mixture of Experts (RMoE), an efficient training pipeline for MoE vision transformers on downstream tasks, such as segmentation and detection. RMoE achieves comparable results with the upper-bound MoE training, while only introducing minor additional training cost than the lower-bound non-MoE training pipelines. The efficiency is supported by our key observation: the weights of an MoE transformer can be factored into an input-independent core and an input-dependent residual. Compared with the weight core, the weight residual can be efficiently trained with much less computation resource, e.g., finetuning on the downstream data. We show that, compared with the current MoE training pipeline, we get comparable results while saving over 30% training cost. When compared with state-of-the-art non- MoE transformers, such as Swin-T / CvT-13 / Swin-L, we get +1.1 / 0.9 / 1.0 mIoU gain on ADE20K segmentation and +1.4 / 1.6 / 0.6 AP gain on MS-COCO object detection task with less than 3% additional training cost.

馬爾可夫鏈 · 近似 · 圖 · 混合 · 均勻采樣 ·

2022 年 4 月 20 日

Approximate Sampling and Counting of Graphs with Near-$P$-stable Degree Intervals

Péter L. Erd?s,Tamás Róbert Mezei,István Miklós

from arxiv, 23 pages

The approximate uniform sampling of graph realizations with a given degree sequence is an everyday task in several social science, computer science, engineering etc. projects. One approach is using Markov chains. The best available current result about the well-studied switch Markov chain is that it is rapidly mixing on P-stable degree sequences (see DOI:10.1016/j.ejc.2021.103421). The switch Markov chain does not change any degree sequence. However, there are cases where degree intervals are specified rather than a single degree sequence. (A natural scenario where this problem arises is in hypothesis testing on social networks that are only partially observed.) Rechner, Strowick, and M\"uller-Hannemann introduced in 2018 the notion of degree interval Markov chain which uses three (separately well-studied) local operations (switch, hinge-flip and toggle), and employing on degree sequence realizations where any two sequences under scrutiny have very small coordinate-wise distance. Recently Amanatidis and Kleer published a beautiful paper (arXiv:2110.09068), showing that the degree interval Markov chain is rapidly mixing if the sequences are coming from a system of very thin intervals which are centered not far from a regular degree sequence. In this paper we extend substantially their result, showing that the degree interval Markov chain is rapidly mixing if the intervals are centred at P-stable degree sequences.

潛變量/隱變量 · 生成模型 · MoDELS · Performer · 學成 ·

2022 年 4 月 18 日

Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models

Ali Ghadirzadeh,Petra Poklukar,Karol Arndt,Chelsea Finn,Ville Kyrki,Danica Kragic,M?rten Bj?rkman

from arxiv, arXiv admin note: substantial text overlap with arXiv:2007.13134

We present a data-efficient framework for solving sequential decision-making problems which exploits the combination of reinforcement learning (RL) and latent variable generative models. The framework, called GenRL, trains deep policies by introducing an action latent variable such that the feed-forward policy search can be divided into two parts: (i) training a sub-policy that outputs a distribution over the action latent variable given a state of the system, and (ii) unsupervised training of a generative model that outputs a sequence of motor actions conditioned on the latent action variable. GenRL enables safe exploration and alleviates the data-inefficiency problem as it exploits prior knowledge about valid sequences of motor actions. Moreover, we provide a set of measures for evaluation of generative models such that we are able to predict the performance of the RL policy training prior to the actual training on a physical robot. We experimentally determine the characteristics of generative models that have most influence on the performance of the final policy training on two robotics tasks: shooting a hockey puck and throwing a basketball. Furthermore, we empirically demonstrate that GenRL is the only method which can safely and efficiently solve the robotics tasks compared to two state-of-the-art RL methods.

可約的 · Networking · 殘差塊 · FAST · 塊 ·

2022 年 4 月 18 日

Fast and Memory-Efficient Network Towards Efficient Image Super-Resolution

Zongcai Du,Ding Liu,Jie Liu,Jie Tang,Gangshan Wu,Lean Fu

from arxiv, Accepted by NTIRE 2022 (CVPR Workshop)

Runtime and memory consumption are two important aspects for efficient image super-resolution (EISR) models to be deployed on resource-constrained devices. Recent advances in EISR exploit distillation and aggregation strategies with plenty of channel split and concatenation operations to make full use of limited hierarchical features. In contrast, sequential network operations avoid frequently accessing preceding states and extra nodes, and thus are beneficial to reducing the memory consumption and runtime overhead. Following this idea, we design our lightweight network backbone by mainly stacking multiple highly optimized convolution and activation layers and decreasing the usage of feature fusion. We propose a novel sequential attention branch, where every pixel is assigned an important factor according to local and global contexts, to enhance high-frequency details. In addition, we tailor the residual block for EISR and propose an enhanced residual block (ERB) to further accelerate the network inference. Finally, combining all the above techniques, we construct a fast and memory-efficient network (FMEN) and its small version FMEN-S, which runs 33% faster and reduces 74% memory consumption compared with the state-of-the-art EISR model: E-RFDN, the champion in AIM 2020 efficient super-resolution challenge. Besides, FMEN-S achieves the lowest memory consumption and the second shortest runtime in NTIRE 2022 challenge on efficient super-resolution. Code is available at //github.com/NJU-Jet/FMEN.

穩健性 · 正交 · 多樣性 · Networking · DNN ·

2022 年 4 月 18 日

Towards Robust Neural Networks via Orthogonal Diversity

Kun Fang,Qinghua Tao,Yingwen Wu,Tao Li,Jia Cai,Feipeng Cai,Xiaolin Huang,Jie Yang

Deep Neural Networks (DNNs) are vulnerable to invisible perturbations on the images generated by adversarial attacks, which raises researches on the adversarial robustness of DNNs. A series of methods represented by the adversarial training and its variants have proven as one of the most effective techniques in enhancing the DNN robustness. Generally, adversarial training focuses on enriching the training data by involving perturbed data. Despite of the efficiency in defending specific attacks, adversarial training is benefited from the data augmentation, which does not contribute to the robustness of DNN itself and usually suffers from accuracy drop on clean data as well as inefficiency in unknown attacks. Towards the robustness of DNN itself, we propose a novel defense that aims at augmenting the model in order to learn features adaptive to diverse inputs, including adversarial examples. Specifically, we introduce multiple paths to augment the network, and impose orthogonality constraints on these paths. In addition, a margin-maximization loss is designed to further boost DIversity via Orthogonality (DIO). Extensive empirical results on various data sets, architectures, and attacks demonstrate the adversarial robustness of the proposed DIO.

可約的 · 估計/估計量 · 狀態估計 · Extensibility · DNN ·

2022 年 4 月 16 日

Nonlinear Reduced DNN Models for State Estimation

Wolfgang Dahmen,Min Wang,Zhu Wang

We propose in this paper a data driven state estimation scheme for generating nonlinear reduced models for parametric families of PDEs, directly providing data-to-state maps, represented in terms of Deep Neural Networks. A major constituent is a sensor-induced decomposition of a model-compliant Hilbert space warranting approximation in problem relevant metrics. It plays a similar role as in a Parametric Background Data Weak framework for state estimators based on Reduced Basis concepts. Extensive numerical tests shed light on several optimization strategies that are to improve robustness and performance of such estimators.

剪枝 · 層 · Networking · state-of-the-art · 端到端 ·

2022 年 4 月 15 日

End-to-End Sensitivity-Based Filter Pruning

Zahra Babaiee,Lucas Liebenwein,Ramin Hasani,Daniela Rus,Radu Grosu

In this paper, we present a novel sensitivity-based filter pruning algorithm (SbF-Pruner) to learn the importance scores of filters of each layer end-to-end. Our method learns the scores from the filter weights, enabling it to account for the correlations between the filters of each layer. Moreover, by training the pruning scores of all layers simultaneously our method can account for layer interdependencies, which is essential to find a performant sparse sub-network. Our proposed method can train and generate a pruned network from scratch in a straightforward, one-stage training process without requiring a pretrained network. Ultimately, we do not need layer-specific hyperparameters and pre-defined layer budgets, since SbF-Pruner can implicitly determine the appropriate number of channels in each layer. Our experimental results on different network architectures suggest that SbF-Pruner outperforms advanced pruning methods. Notably, on CIFAR-10, without requiring a pretrained baseline network, we obtain 1.02% and 1.19% accuracy gain on ResNet56 and ResNet110, compared to the baseline reported for state-of-the-art pruning algorithms. This is while SbF-Pruner reduces parameter-count by 52.3% (for ResNet56) and 54% (for ResNet101), which is better than the state-of-the-art pruning algorithms with a high margin of 9.5% and 6.6%.

Neural Networks · Networking · 可約的 · 估計/估計量 · 可辨認的 ·

2021 年 7 月 7 日

A Survey of Uncertainty in Deep Neural Networks

Jakob Gawlikowski,Cedrique Rovile Njieutcheu Tassi,Mohsin Ali,Jongseok Lee,Matthias Humt,Jianxiang Feng,Anna Kruspe,Rudolph Triebel,Peter Jung,Ribana Roscher,Muhammad Shahzad,Wen Yang,Richard Bamler,Xiao Xiang Zhu

Due to their increasing spread, confidence in neural network predictions became more and more important. However, basic neural networks do not deliver certainty estimates or suffer from over or under confidence. Many researchers have been working on understanding and quantifying uncertainty in a neural network's prediction. As a result, different types and sources of uncertainty have been identified and a variety of approaches to measure and quantify uncertainty in neural networks have been proposed. This work gives a comprehensive overview of uncertainty estimation in neural networks, reviews recent advances in the field, highlights current challenges, and identifies potential research opportunities. It is intended to give anyone interested in uncertainty estimation in neural networks a broad overview and introduction, without presupposing prior knowledge in this field. A comprehensive introduction to the most crucial sources of uncertainty is given and their separation into reducible model uncertainty and not reducible data uncertainty is presented. The modeling of these uncertainties based on deterministic neural networks, Bayesian neural networks, ensemble of neural networks, and test-time data augmentation approaches is introduced and different branches of these fields as well as the latest developments are discussed. For a practical application, we discuss different measures of uncertainty, approaches for the calibration of neural networks and give an overview of existing baselines and implementations. Different examples from the wide spectrum of challenges in different fields give an idea of the needs and challenges regarding uncertainties in practical applications. Additionally, the practical limitations of current methods for mission- and safety-critical real world applications are discussed and an outlook on the next steps towards a broader usage of such methods is given.

數據增強 · 泛化理論 · 矩 · 規范化的 · surge ·

2020 年 2 月 25 日

On Feature Normalization and Data Augmentation

Boyi Li,Felix Wu,Ser-Nam Lim,Serge Belongie,Kilian Q. Weinberger

Modern neural network training relies heavily on data augmentation for improved generalization. After the initial success of label-preserving augmentations, there has been a recent surge of interest in label-perturbing approaches, which combine features and labels across training samples to smooth the learned decision surface. In this paper, we propose a new augmentation method that leverages the first and second moments extracted and re-injected by feature normalization. We replace the moments of the learned features of one training image by those of another, and also interpolate the target labels. As our approach is fast, operates entirely in feature space, and mixes different signals than prior methods, one can effectively combine it with existing augmentation methods. We demonstrate its efficacy across benchmark data sets in computer vision, speech, and natural language processing, where it consistently improves the generalization performance of highly competitive baseline networks.

Networking · Neural Networks · MoDELS · Performer · 模型性能 ·

2019 年 9 月 8 日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Yu Cheng,Duo Wang,Pan Zhou,Tao Zhang

from arxiv, Published in IEEE Signal Processing Magazine, arXiv version including some recent works

Deep convolutional neural networks (CNNs) have recently achieved great success in many visual recognition tasks. However, existing deep neural network models are computationally expensive and memory intensive, hindering their deployment in devices with low memory resources or in applications with strict latency requirements. Therefore, a natural thought is to perform model compression and acceleration in deep networks without significantly decreasing the model performance. During the past few years, tremendous progress has been made in this area. In this paper, we survey the recent advanced techniques for compacting and accelerating CNNs model developed. These techniques are roughly categorized into four schemes: parameter pruning and sharing, low-rank factorization, transferred/compact convolutional filters, and knowledge distillation. Methods of parameter pruning and sharing will be described at the beginning, after that the other techniques will be introduced. For each scheme, we provide insightful analysis regarding the performance, related applications, advantages, and drawbacks etc. Then we will go through a few very recent additional successful methods, for example, dynamic capacity networks and stochastic depths networks. After that, we survey the evaluation matrix, the main datasets used for evaluating the model performance and recent benchmarking efforts. Finally, we conclude this paper, discuss remaining challenges and possible directions on this topic.