久久久久精品电影-欧美精品一区二区视频在线观看

We present Emu, a system that semantically enhances multilingual sentence embeddings. Our framework fine-tunes pre-trained multilingual sentence embeddings using two main components: a semantic classifier and a language discriminator. The semantic classifier improves the semantic similarity of related sentences, whereas the language discriminator enhances the multilinguality of the embeddings via multilingual adversarial training. Our experimental results based on several language pairs show that our specialized embeddings outperform the state-of-the-art multilingual sentence embedding model on the task of cross-lingual intent classification using only monolingual labeled data.

相關內容

判別器

關注 0

語義分析 · 大阪大學 · 人工智能 · 自然語言處理 · AAAI2020 ·

2019 年 11 月 11 日

[付費5元查看完整內容]【AAAI2020接受論文】Emu:使用語義專門化增強多語言句子嵌入，Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization

專知會員服務

專知，提供專業可信的知識分發服務，讓認知協作更快更好！

摘要：我們提出了EMU，一個從語義上增強多語言句子嵌入系統。我們的框架使用兩個主要組件(語義分類器和語言鑒別器)對預先訓練好的多語言句子嵌入進行了微調。語義分類器提高了相關句子的語義相似度，而語言鑒別器通過多語言對抗訓練增強了嵌入語的多語言性。我們的實驗結果基于幾個語言對表明，我們的專門嵌入優于最先進的多語言句子嵌入模型的任務，跨語言意圖分類僅使用單語標記的數據。

付費5元查看完整內容

BERT · 相似度 · 詞元分析器 · 圖像字幕 · Better ·

2019 年 4 月 21 日

BERTScore: Evaluating Text Generation with BERT

Tianyi Zhang,Varsha Kishore,Felix Wu,Kilian Q. Weinberger,Yoav Artzi

from arxiv, Code available at //github.com/Tiiiger/bert_score

We propose BERTScore, an automatic evaluation metric for text generation. Analogous to common metrics, \method computes a similarity score for each token in the candidate sentence with each token in the reference. However, instead of looking for exact matches, we compute similarity using contextualized BERT embeddings. We evaluate on several machine translation and image captioning benchmarks, and show that BERTScore correlates better with human judgments than existing metrics, often significantly outperforming even task-specific supervised metrics.

詞向量表示 · 無監督 · MoDELS · 學成 · 相互獨立的 ·

2018 年 9 月 6 日

Unsupervised Multilingual Word Embeddings

Xilun Chen,Claire Cardie

from arxiv, EMNLP 2018

Multilingual Word Embeddings (MWEs) represent words from multiple languages in a single distributional vector space. Unsupervised MWE (UMWE) methods acquire multilingual embeddings without cross-lingual supervision, which is a significant advantage over traditional supervised approaches and opens many new possibilities for low-resource languages. Prior art for learning UMWEs, however, merely relies on a number of independently trained Unsupervised Bilingual Word Embeddings (UBWEs) to obtain multilingual embeddings. These methods fail to leverage the interdependencies that exist among many languages. To address this shortcoming, we propose a fully unsupervised framework for learning MWEs that directly exploits the relations between all language pairs. Our model substantially outperforms previous approaches in the experiments on multilingual word translation and cross-lingual word similarity. In addition, our model even beats supervised approaches trained with cross-lingual resources.

損失函數（機器學習） · 判別器 · 情感分析 · 泛函 · state-of-the-art ·

2018 年 6 月 15 日

Learning Semantic Sentence Embeddings using Pair-wise Discriminator

Badri N. Patro,Vinod K. Kurmi,Sandeep Kumar,Vinay P. Namboodiri

from arxiv, COLING 2018 (accepted). arXiv admin note: text overlap with arXiv:1405.4053, arXiv:1709.05074 by other authors

In this paper, we propose a method for obtaining sentence-level embeddings. While the problem of securing word-level embeddings is very well studied, we propose a novel method for obtaining sentence-level embeddings. This is obtained by a simple method in the context of solving the paraphrase generation task. If we use a sequential encoder-decoder model for generating paraphrase, we would like the generated paraphrase to be semantically close to the original sentence. One way to ensure this is by adding constraints for true paraphrase embeddings to be close and unrelated paraphrase candidate sentence embeddings to be far. This is ensured by using a sequential pair-wise discriminator that shares weights with the encoder that is trained with a suitable loss function. Our loss function penalizes paraphrase sentence embedding distances from being too large. This loss is used in combination with a sequential encoder-decoder network. We also validated our method by evaluating the obtained embeddings for a sentiment analysis task. The proposed method results in semantic embeddings and outperforms the state-of-the-art on the paraphrase generation and sentiment analysis task on standard datasets. These results are also shown to be statistically significant.

視覺問答 · entity · 注意力機制 · 多峰值 · 學成 ·

2018 年 5 月 24 日

R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering

Pan Lu,Lei Ji,Wei Zhang,Nan Duan,Ming Zhou,Jianyong Wang

from arxiv, 10 pages, 5 figures, accepted in SIGKDD 2018

Recently, Visual Question Answering (VQA) has emerged as one of the most significant tasks in multimodal learning as it requires understanding both visual and textual modalities. Existing methods mainly rely on extracting image and question features to learn their joint feature embedding via multimodal fusion or attention mechanism. Some recent studies utilize external VQA-independent models to detect candidate entities or attributes in images, which serve as semantic knowledge complementary to the VQA task. However, these candidate entities or attributes might be unrelated to the VQA task and have limited semantic capacities. To better utilize semantic knowledge in images, we propose a novel framework to learn visual relation facts for VQA. Specifically, we build up a Relation-VQA (R-VQA) dataset based on the Visual Genome dataset via a semantic similarity module, in which each data consists of an image, a corresponding question, a correct answer and a supporting relation fact. A well-defined relation detector is then adopted to predict visual question-related relation facts. We further propose a multi-step attention model composed of visual attention and semantic attention sequentially to extract related visual knowledge and semantic knowledge. We conduct comprehensive experiments on the two benchmark datasets, demonstrating that our model achieves state-of-the-art performance and verifying the benefit of considering visual relation facts.

Machine Translation · INFORMS · Performer · state-of-the-art · 數據集 ·

2018 年 5 月 18 日

Metric for Automatic Machine Translation Evaluation based on Universal Sentence Representations

Hiroki Shimanaka,Tomoyuki Kajiwara,Mamoru Komachi

from arxiv, NAACL 2018 Student Research Workshop; 6 pages

Sentence representations can capture a wide range of information that cannot be captured by local features based on character or word N-grams. This paper examines the usefulness of universal sentence representations for evaluating the quality of machine translation. Although it is difficult to train sentence representations using small-scale translation datasets with manual evaluation, sentence representations trained from large-scale data in other tasks can improve the automatic evaluation of machine translation. Experimental results of the WMT-2016 dataset show that the proposed method achieves state-of-the-art performance with sentence representation features only.

NMT · Machine Translation · MoDELS · Better · 訓練數據 ·

2018 年 3 月 1 日

Joint Training for Neural Machine Translation Models with Monolingual Data

Zhirui Zhang,Shujie Liu,Mu Li,Ming Zhou,Enhong Chen

from arxiv, Accepted by AAAI 2018

Monolingual data have been demonstrated to be helpful in improving translation quality of both statistical machine translation (SMT) systems and neural machine translation (NMT) systems, especially in resource-poor or domain adaptation tasks where parallel data are not rich enough. In this paper, we propose a novel approach to better leveraging monolingual data for neural machine translation by jointly learning source-to-target and target-to-source NMT models for a language pair with a joint EM optimization method. The training process starts with two initial NMT models pre-trained on parallel data for each direction, and these two models are iteratively updated by incrementally decreasing translation losses on training data. In each iteration step, both NMT models are first used to translate monolingual data from one language to the other, forming pseudo-training data of the other NMT model. Then two new NMT models are learnt from parallel data together with the pseudo training data. Both NMT models are expected to be improved and better pseudo-training data can be generated in next step. Experiment results on Chinese-English and English-German translation tasks show that our approach can simultaneously improve translation quality of source-to-target and target-to-source models, significantly outperforming strong baseline systems which are enhanced with monolingual data for model training including back-translation.

詞向量表示 · 無監督 · 監督 · state-of-the-art · Pair ·

2018 年 1 月 30 日

Word Translation Without Parallel Data

Alexis Conneau,Guillaume Lample,Marc'Aurelio Ranzato,Ludovic Denoyer,Hervé Jégou

from arxiv, ICLR 2018

State-of-the-art methods for learning cross-lingual word embeddings have relied on bilingual dictionaries or parallel corpora. Recent studies showed that the need for parallel data supervision can be alleviated with character-level information. While these methods showed encouraging results, they are not on par with their supervised counterparts and are limited to pairs of languages sharing a common alphabet. In this work, we show that we can build a bilingual dictionary between two languages without using any parallel corpora, by aligning monolingual word embedding spaces in an unsupervised way. Without using any character information, our model even outperforms existing supervised methods on cross-lingual tasks for some language pairs. Our experiments demonstrate that our method works very well also for distant language pairs, like English-Russian or English-Chinese. We finally describe experiments on the English-Esperanto low-resource language pair, on which there only exists a limited amount of parallel data, to show the potential impact of our method in fully unsupervised machine translation. Our code, embeddings and dictionaries are publicly available.

語義相似度 · 相似度 · Performer · 向量化 · entity ·

2018 年 1 月 19 日

A Resource-Light Method for Cross-Lingual Semantic Textual Similarity

Goran Glava?,Marc Franco-Salvador,Simone Paolo Ponzetto,Paolo Rosso

from arxiv, Accepted for publication in Knowledge-Based Systems journal

Recognizing semantically similar sentences or paragraphs across languages is beneficial for many tasks, ranging from cross-lingual information retrieval and plagiarism detection to machine translation. Recently proposed methods for predicting cross-lingual semantic similarity of short texts, however, make use of tools and resources (e.g., machine translation systems, syntactic parsers or named entity recognition) that for many languages (or language pairs) do not exist. In contrast, we propose an unsupervised and a very resource-light approach for measuring semantic similarity between texts in different languages. To operate in the bilingual (or multilingual) space, we project continuous word vectors (i.e., word embeddings) from one language to the vector space of the other language via the linear translation model. We then align words according to the similarity of their vectors in the bilingual embedding space and investigate different unsupervised measures of semantic similarity exploiting bilingual embeddings and word alignments. Requiring only a limited-size set of word translation pairs between the languages, the proposed approach is applicable to virtually any pair of languages for which there exists a sufficiently large corpus, required to learn monolingual word embeddings. Experimental results on three different datasets for measuring semantic textual similarity show that our simple resource-light approach reaches performance close to that of supervised and resource intensive methods, displaying stability across different language pairs. Furthermore, we evaluate the proposed method on two extrinsic tasks, namely extraction of parallel sentences from comparable corpora and cross lingual plagiarism detection, and show that it yields performance comparable to those of complex resource-intensive state-of-the-art models for the respective tasks.

Better · 表示 · MoDELS · 設計 · 自然語言處理 ·

2017 年 12 月 2 日

Improving Visually Grounded Sentence Representations with Self-Attention

Kang Min Yoo,Youhyun Shin,Sang-goo Lee

Sentence representation models trained only on language could potentially suffer from the grounding problem. Recent work has shown promising results in improving the qualities of sentence representations by jointly training them with associated image features. However, the grounding capability is limited due to distant connection between input sentences and image features by the design of the architecture. In order to further close the gap, we propose applying self-attention mechanism to the sentence encoder to deepen the grounding effect. Our results on transfer tasks show that self-attentive encoders are better for visual grounding, as they exploit specific words with strong visual associations.