亚洲男人的天堂2018av,欧美草比,久久久久久免费视频精选,国色天香在线看免费,久久久久亚洲av成人片仓井空

<tr id='Ah6Z4'><strong id='RRH30'></strong><small id='JQw58'></small><button id='R4zXt'></button><li id='aoYsH'><noscript id='82jD2'><big id='v1EXG'></big><dt id='cQ6MO'></dt></noscript></li></tr><ol id='69bBn'><option id='mDjup'><table id='HnOO5'><blockquote id='mtMs7'><tbody id='1HdbO'></tbody></blockquote></table></option></ol><u id='TmIy6'></u><kbd id='a8Xjx'><kbd id='cmxfu'></kbd></kbd>

<code id='Z0mzA'><strong id='fxvms'></strong></code>

<fieldset id='F0Igi'></fieldset>

<span id='pZjho'></span>

<ins id='E6cdt'></ins>

<acronym id='AumYd'><em id='UQSgT'></em><td id='x7HR9'><div id='TpDdZ'></div></td></acronym><address id='33ehT'><big id='fEaR3'><big id='ToWpo'></big><legend id='RVp2k'></legend></big></address>

<i id='zXfby'><div id='ayJ28'><ins id='DT0Qe'></ins></div></i>

<i id='3xzli'></i>

·

判別器 · 圖卷積 · 注意力機制 · 長短期記憶網絡 · 圖 ·

2019 年 3 月 29 日

An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition

Chenyang Si,Wentao Chen,Wei Wang,Liang Wang,Tieniu Tan

from arxiv, Accepted by CVPR2019

Skeleton-based action recognition is an important task that requires the adequate understanding of movement characteristics of a human action from the given skeleton sequence. Recent studies have shown that exploring spatial and temporal features of the skeleton sequence is vital for this task. Nevertheless, how to effectively extract discriminative spatial and temporal features is still a challenging problem. In this paper, we propose a novel Attention Enhanced Graph Convolutional LSTM Network (AGC-LSTM) for human action recognition from skeleton data. The proposed AGC-LSTM can not only capture discriminative features in spatial configuration and temporal dynamics but also explore the co-occurrence relationship between spatial and temporal domains. We also present a temporal hierarchical architecture to increases temporal receptive fields of the top AGC-LSTM layer, which boosts the ability to learn the high-level semantic representation and significantly reduces the computation cost. Furthermore, to select discriminative spatial information, the attention mechanism is employed to enhance information of key joints in each AGC-LSTM layer. Experimental results on two datasets are provided: NTU RGB+D dataset and Northwestern-UCLA dataset. The comparison results demonstrate the effectiveness of our approach and show that our approach outperforms the state-of-the-art methods on both datasets.

相關內容

判別器

圖注意力網絡 · 文本分類 · 圖 · 注意力機制 · Networking ·

2020 年 3 月 22 日

Multi-Label Text Classification using Attention-based Graph Neural Network

Ankit Pal,Muru Selvakumar,Malaikannan Sankarasubbu

In Multi-Label Text Classification (MLTC), one sample can belong to more than one class. It is observed that most MLTC tasks, there are dependencies or correlations among labels. Existing methods tend to ignore the relationship among labels. In this paper, a graph attention network-based model is proposed to capture the attentive dependency structure among the labels. The graph attention network uses a feature matrix and a correlation matrix to capture and explore the crucial dependencies between the labels and generate classifiers for the task. The generated classifiers are applied to sentence feature vectors obtained from the text feature extraction network (BiLSTM) to enable end-to-end training. Attention allows the system to assign different weights to neighbor nodes per label, thus allowing it to learn the dependencies among labels implicitly. The results of the proposed model are validated on five real-world MLTC datasets. The proposed model achieves similar or better performance compared to the previous state-of-the-art models.

圖卷積神經網絡/圖卷積網絡 · 圖 · 圖卷積 · state-of-the-art · Networking ·

2019 年 9 月 7 日

Graph Convolutional Networks for Temporal Action Localization

Runhao Zeng,Wenbing Huang,Mingkui Tan,Yu Rong,Peilin Zhao,Junzhou Huang,Chuang Gan

from arxiv, ICCV 2019

Most state-of-the-art action localization systems process each action proposal individually, without explicitly exploiting their relations during learning. However, the relations between proposals actually play an important role in action localization, since a meaningful action always consists of multiple proposals in a video. In this paper, we propose to exploit the proposal-proposal relations using Graph Convolutional Networks (GCNs). First, we construct an action proposal graph, where each proposal is represented as a node and their relations between two proposals as an edge. Here, we use two types of relations, one for capturing the context information for each proposal and the other one for characterizing the correlations between distinct actions. Then we apply the GCNs over the graph to model the relations among different proposals and learn powerful representations for the action classification and localization. Experimental results show that our approach significantly outperforms the state-of-the-art on THUMOS14 (49.1% versus 42.8%). Moreover, augmentation experiments on ActivityNet also verify the efficacy of modeling action proposal relationships. Codes are available at //github.com/Alvin-Zeng/PGCN.

entity · 命名實體識別 · Networking · 卷積 · Extensibility ·

2019 年 4 月 30 日

CAN-NER: Convolutional Attention Network for Chinese Named Entity Recognition

Yuying Zhu,Guoxin Wang,B?rje F. Karlsson

from arxiv, This paper is accepted by NAACL-HLT 2019

Named entity recognition (NER) in Chinese is essential but difficult because of the lack of natural delimiters. Therefore, Chinese Word Segmentation (CWS) is usually considered as the first step for Chinese NER. However, models based on word-level embeddings and lexicon features often suffer from segmentation errors and out-of-vocabulary (OOV) words. In this paper, we investigate a Convolutional Attention Network called CAN for Chinese NER, which consists of a character-based convolutional neural network (CNN) with local-attention layer and a gated recurrent unit (GRU) with global self-attention layer to capture the information from adjacent characters and sentence contexts. Also, compared to other models, not depending on any external resources like lexicons and employing small size of char embeddings make our model more practical. Extensive experimental results show that our approach outperforms state-of-the-art methods without word embedding and external lexicon resources on different domain datasets including Weibo, MSRA and Chinese Resume NER dataset.

Networking · MoDELS · PARCO · 卷積 · INTERACT ·

2019 年 4 月 8 日

Convolutional Self-Attention Network

Baosong Yang,Longyue Wang,Derek F. Wong,Lidia S. Chao,Zhaopeng Tu

from arxiv, The least version of this paper has been uploaded to another link: arXiv:1904.03107

Self-attention network (SAN) has recently attracted increasing interest due to its fully parallelized computation and flexibility in modeling dependencies. It can be further enhanced with multi-headed attention mechanism by allowing the model to jointly attend to information from different representation subspaces at different positions (Vaswani et al., 2017). In this work, we propose a novel convolutional self-attention network (CSAN), which offers SAN the abilities to 1) capture neighboring dependencies, and 2) model the interaction between multiple attention heads. Experimental results on WMT14 English-to-German translation task demonstrate that the proposed approach outperforms both the strong Transformer baseline and other existing works on enhancing the locality of SAN. Comparing with previous work, our model does not introduce any new parameters.

視頻描述生成（Video Caption） · 端到端 · 解碼 · state-of-the-art · Networking ·

2019 年 4 月 4 日

An End-to-End Baseline for Video Captioning

Silvio Olivastri,Gurkirt Singh,Fabio Cuzzolin

from arxiv, 8-main-pages and 2-pages references

Building correspondences across different modalities, such as video and language, has recently become critical in many visual recognition applications, such as video captioning. Inspired by machine translation, recent models tackle this task using an encoder-decoder strategy. The (video) encoder is traditionally a Convolutional Neural Network (CNN), while the decoding (for language generation) is done using a Recurrent Neural Network (RNN). Current state-of-the-art methods, however, train encoder and decoder separately. CNNs are pretrained on object and/or action recognition tasks and used to encode video-level features. The decoder is then optimised on such static features to generate the video's description. This disjoint setup is arguably sub-optimal for input (video) to output (description) mapping. In this work, we propose to optimise both encoder and decoder simultaneously in an end-to-end fashion. In a two-stage training setting, we first initialise our architecture using pre-trained encoders and decoders -- then, the entire network is trained end-to-end in a fine-tuning stage to learn the most relevant features for video caption generation. In our experiments, we use GoogLeNet and Inception-ResNet-v2 as encoders and an original Soft-Attention (SA-) LSTM as a decoder. Analogously to gains observed in other computer vision problems, we show that end-to-end training significantly improves over the traditional, disjoint training process. We evaluate our End-to-End (EtENet) Networks on the Microsoft Research Video Description (MSVD) and the MSR Video to Text (MSR-VTT) benchmark datasets, showing how EtENet achieves state-of-the-art performance across the board.

entity · 命名實體識別 · Networking · 卷積 · Extensibility ·

2019 年 4 月 3 日

CAN-NER: Convolutional Attention Network forChinese Named Entity Recognition

Yuying Zhu,Guoxin Wang,B?rje F. Karlsson

Named entity recognition (NER) in Chinese is essential but difficult because of the lack of natural delimiters. Therefore, Chinese Word Segmentation (CWS) is usually considered as the first step for Chinese NER. However, models based on word-level embeddings and lexicon features often suffer from segmentation errors and out-of-vocabulary (OOV) words. In this paper, we investigate a Convolutional Attention Network called CAN for Chinese NER, which consists of a character-based convolutional neural network (CNN) with local-attention layer and a gated recurrent unit (GRU) with global self-attention layer to capture the information from adjacent characters and sentence contexts. Also, compared to other models, not depending on any external resources like lexicons and employing small size of char embeddings make our model more practical. Extensive experimental results show that our approach outperforms state-of-the-art methods without word embedding and external lexicon resources on different domain datasets including Weibo, MSRA and Chinese Resume NER dataset.

視覺問答 · 圖注意力網絡 · 圖 · INTERACT · 注意力機制 ·

2019 年 3 月 29 日

Relation-aware Graph Attention Network for Visual Question Answering

Linjie Li,Zhe Gan,Yu Cheng,Jingjing Liu

In order to answer semantically-complicated questions about an image, a Visual Question Answering (VQA) model needs to fully understand the visual scene in the image, especially the interactive dynamics between different objects. We propose a Relation-aware Graph Attention Network (ReGAT), which encodes each image into a graph and models multi-type inter-object relations via a graph attention mechanism, to learn question-adaptive relation representations. Two types of visual object relations are explored: (i) Explicit Relations that represent geometric positions and semantic interactions between objects; and (ii) Implicit Relations that capture the hidden dynamics between image regions. Experiments demonstrate that ReGAT outperforms prior state-of-the-art approaches on both VQA 2.0 and VQA-CP v2 datasets. We further show that ReGAT is compatible to existing VQA architectures, and can be used as a generic relation encoder to boost the model performance for VQA.

注意力機制 · Networking · state-of-the-art · 模型評估 · 可辨認的 ·

2018 年 9 月 6 日

Global-and-local attention networks for visual recognition

Drew Linsley,Dan Shiebler,Sven Eberhardt,Thomas Serre

State-of-the-art deep convolutional networks (DCNs) such as squeeze-and- excitation (SE) residual networks implement a form of attention, also known as contextual guidance, which is derived from global image features. Here, we explore a complementary form of attention, known as visual saliency, which is derived from local image features. We extend the SE module with a novel global-and-local attention (GALA) module which combines both forms of attention -- resulting in state-of-the-art accuracy on ILSVRC. We further describe ClickMe.ai, a large-scale online experiment designed for human participants to identify diagnostic image regions to co-train a GALA network. Adding humans-in-the-loop is shown to significantly improve network accuracy, while also yielding visual features that are more interpretable and more similar to those used by human observers.

視頻描述生成（Video Caption） · Networking · 后向 · 前向 · 狀態序列 ·

2018 年 3 月 30 日

Reconstruction Network for Video Captioning

Bairui Wang,Lin Ma,Wei Zhang,Wei Liu

from arxiv, Accepted by CVPR 2018

In this paper, the problem of describing visual contents of a video sequence with natural language is addressed. Unlike previous video captioning work mainly exploiting the cues of video contents to make a language description, we propose a reconstruction network (RecNet) with a novel encoder-decoder-reconstructor architecture, which leverages both the forward (video to sentence) and backward (sentence to video) flows for video captioning. Specifically, the encoder-decoder makes use of the forward flow to produce the sentence description based on the encoded video semantic features. Two types of reconstructors are customized to employ the backward flow and reproduce the video features based on the hidden state sequence generated by the decoder. The generation loss yielded by the encoder-decoder and the reconstruction loss introduced by the reconstructor are jointly drawn into training the proposed RecNet in an end-to-end fashion. Experimental results on benchmark datasets demonstrate that the proposed reconstructor can boost the encoder-decoder models and leads to significant gains in video caption accuracy.

entity · 圖卷積神經網絡/圖卷積網絡 · Performer · 命名實體識別 · 圖卷積 ·

2018 年 2 月 14 日

Graph Convolutional Networks for Named Entity Recognition

A. Cetoli,S. Bragaglia,A. D. O'Harney,M. Sloan

from arxiv, Accepted at the 16th International Workshop on Treebanks and Linguistic Theories

In this paper we investigate the role of the dependency tree in a named entity recognizer upon using a set of GCN. We perform a comparison among different NER architectures and show that the grammar of a sentence positively influences the results. Experiments on the ontonotes dataset demonstrate consistent performance improvements, without requiring heavy feature engineering nor additional language-specific knowledge.

閱讀: 0 點贊: 0

小貼士

登錄享

相關主題

注意力機制

長短期記憶網絡

北京阿比特科技有限公司

注冊地址：北京市海淀區羊坊店路18號2幢3層301-191

<tr id='eeilp'><strong id='eeilp'></strong><small id='eeilp'></small><button id='eeilp'></button><li id='eeilp'><noscript id='eeilp'><big id='eeilp'></big><dt id='eeilp'></dt></noscript></li></tr><ol id='eeilp'><option id='eeilp'><table id='eeilp'><blockquote id='eeilp'><tbody id='eeilp'></tbody></blockquote></table></option></ol><u id='eeilp'></u><kbd id='eeilp'><kbd id='eeilp'></kbd></kbd>

<code id='eeilp'><strong id='eeilp'></strong></code>

<fieldset id='eeilp'></fieldset>

<span id='eeilp'></span>

<ins id='eeilp'></ins>

<acronym id='eeilp'><em id='eeilp'></em><td id='eeilp'><div id='eeilp'></div></td></acronym><address id='eeilp'><big id='eeilp'><big id='eeilp'></big><legend id='eeilp'></legend></big></address>

<i id='eeilp'><div id='eeilp'><ins id='eeilp'></ins></div></i>

<i id='eeilp'></i>