人人干人人摸人人操,国产精品一看一级毛片,免费又粗又黄又硬又爽大片高清

One of the most popular methods for context-aware machine translation (MT) is to use separate encoders for the source sentence and context as multiple sources for one target sentence. Recent work has cast doubt on whether these models actually learn useful signals from the context or are improvements in automatic evaluation metrics just a side-effect. We show that multi-source transformer models improve MT over standard transformer-base models even with empty lines provided as context, but the translation quality improves significantly (1.51 - 2.65 BLEU) when a sufficient amount of correct context is provided. We also show that even though randomly shuffling in-domain context can also improve over baselines, the correct context further improves translation quality and random out-of-domain context further degrades it.

相關內容

Machine Translation

關注 0

機器翻譯（Machine Translation）涵蓋計算語(yu)(yu)言學(xue)和語(yu)(yu)言工程(cheng)的所有分支(zhi)，包含多(duo)語(yu)(yu)言方面(mian)。特色論文涵蓋理論，描述或計算方面(mian)的任何(he)下列主題:雙語(yu)(yu)和多(duo)語(yu)(yu)語(yu)(yu)料庫的編寫和使(shi)用，計算機輔助語(yu)(yu)言教學(xue)，非(fei)羅馬字符(fu)集的計算含義，連接主義翻譯方法，對比語(yu)(yu)言學(xue)等。官(guan)網地(di)址：

NMT · MoDELS · Machine Translation · AIM · 輸出 ·

2019 年 1 月 25 日

Context in Neural Machine Translation: A Review of Models and Evaluations

Andrei Popescu-Belis

This review paper discusses how context has been used in neural machine translation (NMT) in the past two years (2017-2018). Starting with a brief retrospect on the rapid evolution of NMT models, the paper then reviews studies that evaluate NMT output from various perspectives, with emphasis on those analyzing limitations of the translation of contextual phenomena. In a subsequent version, the paper will then present the main methods that were proposed to leverage context for improving translation quality, and distinguishes methods that aim to improve the translation of specific phenomena from those that consider a wider unstructured context.

Machine Translation · MoDELS · INFORMS · NMT · 基準 ·

2018 年 6 月 12 日

Fusing Recency into Neural Machine Translation with an Inter-Sentence Gate Model

Shaohui Kuang,Deyi Xiong

from arxiv, Accepted by COLING2018.11 pages,4 figures

Neural machine translation (NMT) systems are usually trained on a large amount of bilingual sentence pairs and translate one sentence at a time, ignoring inter-sentence information. This may make the translation of a sentence ambiguous or even inconsistent with the translations of neighboring sentences. In order to handle this issue, we propose an inter-sentence gate model that uses the same encoder to encode two adjacent sentences and controls the amount of information flowing from the preceding sentence to the translation of the current sentence with an inter-sentence gate. In this way, our proposed model can capture the connection between sentences and fuse recency from neighboring sentences into neural machine translation. On several NIST Chinese-English translation tasks, our experiments demonstrate that the proposed inter-sentence gate model achieves substantial improvements over the baseline.

可約的 · Machine Translation · MoDELS · NMT · Extensibility ·

2018 年 5 月 29 日

Bi-Directional Neural Machine Translation with Synthetic Parallel Data

Xing Niu,Michael Denkowski,Marine Carpuat

from arxiv, Accepted at the 2nd Workshop on Neural Machine Translation and Generation (WNMT 2018)

Despite impressive progress in high-resource settings, Neural Machine Translation (NMT) still struggles in low-resource and out-of-domain scenarios, often failing to match the quality of phrase-based translation. We propose a novel technique that combines back-translation and multilingual NMT to improve performance in these difficult cases. Our technique trains a single model for both directions of a language pair, allowing us to back-translate source or target monolingual data without requiring an auxiliary model. We then continue training on the augmented parallel data, enabling a cycle of improvement for a single model that can incorporate any source, target, or parallel data to improve both translation directions. As a byproduct, these models can reduce training and deployment costs significantly compared to uni-directional models. Extensive experiments show that our technique outperforms standard back-translation in low-resource scenarios, improves quality on cross-domain tasks, and effectively reduces costs across the board.

Machine Translation · NMT · MoDELS · INFORMS · Pair ·

2018 年 5 月 28 日

Inducing Grammars with and for Neural Machine Translation

Ke Tran,Yonatan Bisk

from arxiv, accepted at NMT workshop (WNMT 2018)

Machine translation systems require semantic knowledge and grammatical understanding. Neural machine translation (NMT) systems often assume this information is captured by an attention mechanism and a decoder that ensures fluency. Recent work has shown that incorporating explicit syntax alleviates the burden of modeling both types of knowledge. However, requiring parses is expensive and does not explore the question of what syntax a model needs during translation. To address both of these issues we introduce a model that simultaneously translates while inducing dependency trees. In this way, we leverage the benefits of structure while investigating what syntax NMT must induce to maximize performance. We show that our dependency trees are 1. language pair dependent and 2. improve translation quality.

Machine Translation · CASES · MoDELS · 注意力分布 · INFORMS ·

2018 年 5 月 25 日

Context-Aware Neural Machine Translation Learns Anaphora Resolution

Elena Voita,Pavel Serdyukov,Rico Sennrich,Ivan Titov

from arxiv, ACL 2018

Standard machine translation systems process sentences in isolation and hence ignore extra-sentential information, even though extended context can both prevent mistakes in ambiguous cases and improve translation coherence. We introduce a context-aware neural machine translation model designed in such way that the flow of information from the extended context to the translation model can be controlled and analyzed. We experiment with an English-Russian subtitles dataset, and observe that much of what is captured by our model deals with improving pronoun translation. We measure correspondences between induced attention distributions and coreference relations and observe that the model implicitly captures anaphora. It is consistent with gains for sentences where pronouns need to be gendered in translation. Beside improvements in anaphoric cases, the model also improves in overall BLEU, both over its context-agnostic version (+0.7) and over simple concatenation of the context and source sentences (+0.6).

Machine Translation · Weight · 權共享 · 無監督 · NMT ·

2018 年 4 月 24 日

Unsupervised Neural Machine Translation with Weight Sharing

Zhen Yang,Wei Chen,Feng Wang,Bo Xu

from arxiv, Unsupervised NMT, Accepted by ACL2018, code released

Unsupervised neural machine translation (NMT) is a recently proposed approach for machine translation which aims to train the model without using any labeled data. The models proposed for unsupervised NMT often use only one shared encoder to map the pairs of sentences from different languages to a shared-latent space, which is weak in keeping the unique and internal characteristics of each language, such as the style, terminology, and sentence structure. To address this issue, we introduce an extension by utilizing two independent encoders but sharing some partial weights which are responsible for extracting high-level representations of the input sentences. Besides, two different generative adversarial networks (GANs), namely the local GAN and global GAN, are proposed to enhance the cross-language translation. With this new approach, we achieve significant improvements on English-German, English-French and Chinese-to-English translation tasks.

注意力機制 · Machine Translation · 上下文向量 · 向量化 · 得分 ·

2018 年 4 月 3 日

Fine-Grained Attention Mechanism for Neural Machine Translation

Heeyoul Choi,Kyunghyun Cho,Yoshua Bengio

from arxiv, 9 pages, 4 figures

Neural machine translation (NMT) has been a new paradigm in machine translation, and the attention mechanism has become the dominant approach with the state-of-the-art records in many language pairs. While there are variants of the attention mechanism, all of them use only temporal attention where one scalar value is assigned to one context vector corresponding to a source word. In this paper, we propose a fine-grained (or 2D) attention mechanism where each dimension of a context vector will receive a separate attention score. In experiments with the task of En-De and En-Fi translation, the fine-grained attention method improves the translation quality in terms of BLEU score. In addition, our alignment analysis reveals how the fine-grained attention mechanism exploits the internal structure of context vectors.

Machine Translation · NMT · 詞義消歧 · Performer · MoDELS ·

2018 年 3 月 28 日

Handling Homographs in Neural Machine Translation

Frederick Liu,Han Lu,Graham Neubig

from arxiv, NAACL2018

Homographs, words with different meanings but the same surface form, have long caused difficulty for machine translation systems, as it is difficult to select the correct translation based on the context. However, with the advent of neural machine translation (NMT) systems, which can theoretically take into account global sentential context, one may hypothesize that this problem has been alleviated. In this paper, we first provide empirical evidence that existing NMT systems in fact still have significant problems in properly translating ambiguous words. We then proceed to describe methods, inspired by the word sense disambiguation literature, that model the context of the input word with context-aware word embeddings that help to differentiate the word sense be- fore feeding it into the encoder. Experiments on three language pairs demonstrate that such models improve the performance of NMT systems both in terms of BLEU score and in the accuracy of translating homographs.

詞向量表示 · 無監督 · 監督 · state-of-the-art · Pair ·

2018 年 1 月 30 日

Word Translation Without Parallel Data

Alexis Conneau,Guillaume Lample,Marc'Aurelio Ranzato,Ludovic Denoyer,Hervé Jégou

from arxiv, ICLR 2018

State-of-the-art methods for learning cross-lingual word embeddings have relied on bilingual dictionaries or parallel corpora. Recent studies showed that the need for parallel data supervision can be alleviated with character-level information. While these methods showed encouraging results, they are not on par with their supervised counterparts and are limited to pairs of languages sharing a common alphabet. In this work, we show that we can build a bilingual dictionary between two languages without using any parallel corpora, by aligning monolingual word embedding spaces in an unsupervised way. Without using any character information, our model even outperforms existing supervised methods on cross-lingual tasks for some language pairs. Our experiments demonstrate that our method works very well also for distant language pairs, like English-Russian or English-Chinese. We finally describe experiments on the English-Esperanto low-resource language pair, on which there only exists a limited amount of parallel data, to show the potential impact of our method in fully unsupervised machine translation. Our code, embeddings and dictionaries are publicly available.

Machine Translation · Performer · 向量化 · MoDELS · state-of-the-art ·

2016 年 5 月 19 日

Neural Machine Translation by Jointly Learning to Align and Translate

Dzmitry Bahdanau,Kyunghyun Cho,Yoshua Bengio

from arxiv, Accepted at ICLR 2015 as oral presentation

Neural machine translation is a recently proposed approach to machine translation. Unlike the traditional statistical machine translation, the neural machine translation aims at building a single neural network that can be jointly tuned to maximize the translation performance. The models proposed recently for neural machine translation often belong to a family of encoder-decoders and consists of an encoder that encodes a source sentence into a fixed-length vector from which a decoder generates a translation. In this paper, we conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder-decoder architecture, and propose to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly. With this new approach, we achieve a translation performance comparable to the existing state-of-the-art phrase-based system on the task of English-to-French translation. Furthermore, qualitative analysis reveals that the (soft-)alignments found by the model agree well with our intuition.