国产乱理伦片A级在线看_久久久久精品一区二区三区_久久久九九精品欧美一区二区_久久毛片艾草一级_99精品国产高清久久久_日本三级香港三级三级人妇久_国产1卡2卡三卡四卡老狼

SGNMT is a decoding platform for machine translation which allows paring various modern neural models of translation with different kinds of constraints and symbolic models. In this paper, we describe three use cases in which SGNMT is currently playing an active role: (1) teaching as SGNMT is being used for course work and student theses in the MPhil in Machine Learning, Speech and Language Technology at the University of Cambridge, (2) research as most of the research work of the Cambridge MT group is based on SGNMT, and (3) technology transfer as we show how SGNMT is helping to transfer research findings from the laboratory to the industry, eg. into a product of SDL plc.

相關內容

Machine Translation

關注 209

機(ji)器翻譯（Machine Translation）涵蓋(gai)計(ji)算語(yu)(yu)言(yan)學和(he)(he)語(yu)(yu)言(yan)工程的所有(you)分支，包含多語(yu)(yu)言(yan)方面(mian)。特色(se)論(lun)文涵蓋(gai)理論(lun)，描(miao)述(shu)或計(ji)算方面(mian)的任何下列主題(ti):雙語(yu)(yu)和(he)(he)多語(yu)(yu)語(yu)(yu)料庫的編寫和(he)(he)使(shi)用(yong)，計(ji)算機(ji)輔助語(yu)(yu)言(yan)教學，非(fei)羅馬字(zi)符(fu)集的計(ji)算含義，連接(jie)主義翻譯方法，對比語(yu)(yu)言(yan)學等(deng)。官網地址：

Machine Translation · NMT · MoDELS · INFORMS · Pair ·

2018 年 5 月 28 日

Inducing Grammars with and for Neural Machine Translation

Ke Tran,Yonatan Bisk

from arxiv, accepted at NMT workshop (WNMT 2018)

Machine translation systems require semantic knowledge and grammatical understanding. Neural machine translation (NMT) systems often assume this information is captured by an attention mechanism and a decoder that ensures fluency. Recent work has shown that incorporating explicit syntax alleviates the burden of modeling both types of knowledge. However, requiring parses is expensive and does not explore the question of what syntax a model needs during translation. To address both of these issues we introduce a model that simultaneously translates while inducing dependency trees. In this way, we leverage the benefits of structure while investigating what syntax NMT must induce to maximize performance. We show that our dependency trees are 1. language pair dependent and 2. improve translation quality.

Machine Translation · MoDELS · Processing（編程語言） · 解碼 · 向量空間 ·

2018 年 5 月 28 日

A Stochastic Decoder for Neural Machine Translation

Philip Schulz,Wilker Aziz,Trevor Cohn

from arxiv, Accepted at ACL 2018

The process of translation is ambiguous, in that there are typically many valid trans- lations for a given sentence. This gives rise to significant variation in parallel cor- pora, however, most current models of machine translation do not account for this variation, instead treating the prob- lem as a deterministic process. To this end, we present a deep generative model of machine translation which incorporates a chain of latent variables, in order to ac- count for local lexical and syntactic varia- tion in parallel corpora. We provide an in- depth analysis of the pitfalls encountered in variational inference for training deep generative models. Experiments on sev- eral different language pairs demonstrate that the model consistently improves over strong baselines.

Machine Translation · NMT · Extensibility · Performer · MoDELS ·

2018 年 5 月 28 日

OpenNMT: Neural Machine Translation Toolkit

Guillaume Klein,Yoon Kim,Yuntian Deng,Vincent Nguyen,Jean Senellart,Alexander M. Rush

from arxiv, Presentation to AMTA 2018 - Boston. arXiv admin note: substantial text overlap with arXiv:1701.02810

OpenNMT is an open-source toolkit for neural machine translation (NMT). The system prioritizes efficiency, modularity, and extensibility with the goal of supporting NMT research into model architectures, feature representations, and source modalities, while maintaining competitive performance and reasonable training requirements. The toolkit consists of modeling and translation support, as well as detailed pedagogical documentation about the underlying techniques. OpenNMT has been used in several production MT systems, modified for numerous research papers, and is implemented across several deep learning frameworks.

注意力機制 · 稀疏 · Machine Translation · NMT · 變換 ·

2018 年 5 月 21 日

Sparse and Constrained Attention for Neural Machine Translation

Chaitanya Malaviya,Pedro Ferreira,André F. T. Martins

from arxiv, Proceedings of ACL 2018

In NMT, words are sometimes dropped from the source or generated repeatedly in the translation. We explore novel strategies to address the coverage problem that change only the attention transformation. Our approach allocates fertilities to source words, used to bound the attention each word can receive. We experiment with various sparse and constrained attention transformations and propose a new one, constrained sparsemax, shown to be differentiable and sparse. Empirical evaluation is provided in three languages pairs.

Machine Translation · 學成 · MoDELS · 無監督 · Performer ·

2018 年 4 月 13 日

Unsupervised Machine Translation Using Monolingual Corpora Only

Guillaume Lample,Alexis Conneau,Ludovic Denoyer,Marc'Aurelio Ranzato

from arxiv, ICLR 2018

Machine translation has recently achieved impressive performance thanks to recent advances in deep learning and the availability of large-scale parallel corpora. There have been numerous attempts to extend these successes to low-resource language pairs, yet requiring tens of thousands of parallel sentences. In this work, we take this research direction to the extreme and investigate whether it is possible to learn to translate even without any parallel data. We propose a model that takes sentences from monolingual corpora in two different languages and maps them into the same latent space. By learning to reconstruct in both languages from this shared feature space, the model effectively learns to translate without using any labeled data. We demonstrate our model on two widely used datasets and two language pairs, reporting BLEU scores of 32.8 and 15.1 on the Multi30k and WMT English-French datasets, without using even a single parallel sentence at training time.

INTERACT · 學成 · 多峰值 · Better · Machine Translation ·

2018 年 4 月 11 日

Emergent Translation in Multi-Agent Communication

Jason Lee,Kyunghyun Cho,Jason Weston,Douwe Kiela

from arxiv, Accepted to ICLR 2018

While most machine translation systems to date are trained on large parallel corpora, humans learn language in a different way: by being grounded in an environment and interacting with other humans. In this work, we propose a communication game where two agents, native speakers of their own respective languages, jointly learn to solve a visual referential task. We find that the ability to understand and translate a foreign language emerges as a means to achieve shared goals. The emergent translation is interactive and multimodal, and crucially does not require parallel corpora, but only monolingual, independent text and corresponding images. Our proposed translation model achieves this by grounding the source and target languages into a shared visual modality, and outperforms several baselines on both word-level and sentence-level translation tasks. Furthermore, we show that agents in a multilingual community learn to translate better and faster than in a bilingual communication setting.

Extensibility · Machine Translation · NMT · HTTPS · FAST ·

2018 年 3 月 1 日

XNMT: The eXtensible Neural Machine Translation Toolkit

Graham Neubig,Matthias Sperber,Xinyi Wang,Matthieu Felix,Austin Matthews,Sarguna Padmanabhan,Ye Qi,Devendra Singh Sachan,Philip Arthur,Pierre Godard,John Hewitt,Rachid Riad,Liming Wang

from arxiv, To be presented at AMTA 2018 Open Source Software Showcase

This paper describes XNMT, the eXtensible Neural Machine Translation toolkit. XNMT distin- guishes itself from other open-source NMT toolkits by its focus on modular code design, with the purpose of enabling fast iteration in research and replicable, reliable results. In this paper we describe the design of XNMT and its experiment configuration system, and demonstrate its utility on the tasks of machine translation, speech recognition, and multi-tasked machine translation/parsing. XNMT is available open-source at //github.com/neulab/xnmt

Machine Translation · 穩健性 · 噪聲 · MoDELS · NMT ·

2018 年 2 月 24 日

Synthetic and Natural Noise Both Break Neural Machine Translation

Yonatan Belinkov,Yonatan Bisk

from arxiv, ICLR 2018 camera-ready

Character-based neural machine translation (NMT) models alleviate out-of-vocabulary issues, learn morphology, and move us closer to completely end-to-end translation systems. Unfortunately, they are also very brittle and easily falter when presented with noisy data. In this paper, we confront NMT models with synthetic and natural sources of noise. We find that state-of-the-art models fail to translate even moderately noisy texts that humans have no trouble comprehending. We explore two approaches to increase model robustness: structure-invariant word representations and robust training on noisy texts. We find that a model based on a character convolutional neural network is able to simultaneously learn representations robust to multiple kinds of noise.

Machine Translation · Notability · INTERACT · 圖像字幕 · 多峰值 ·

2018 年 2 月 9 日

Zero-Resource Neural Machine Translation with Multi-Agent Communication Game

Yun Chen,Yang Liu,Victor O. K. Li

from arxiv, Published at AAAI-18

While end-to-end neural machine translation (NMT) has achieved notable success in the past years in translating a handful of resource-rich language pairs, it still suffers from the data scarcity problem for low-resource language pairs and domains. To tackle this problem, we propose an interactive multimodal framework for zero-resource neural machine translation. Instead of being passively exposed to large amounts of parallel corpora, our learners (implemented as encoder-decoder architecture) engage in cooperative image description games, and thus develop their own image captioning or neural machine translation model from the need to communicate in order to succeed at the game. Experimental results on the IAPR-TC12 and Multi30K datasets show that the proposed learning mechanism significantly improves over the state-of-the-art methods.

Machine Translation · NMT · 隨機變量 · 后驗推斷 · 再參數化/重參數化 ·

2018 年 1 月 16 日

Variational Recurrent Neural Machine Translation

Jinsong Su,Shan Wu,Deyi Xiong,Yaojie Lu,Xianpei Han,Biao Zhang

from arxiv, accepted by AAAI 18

Partially inspired by successful applications of variational recurrent neural networks, we propose a novel variational recurrent neural machine translation (VRNMT) model in this paper. Different from the variational NMT, VRNMT introduces a series of latent random variables to model the translation procedure of a sentence in a generative way, instead of a single latent variable. Specifically, the latent random variables are included into the hidden states of the NMT decoder with elements from the variational autoencoder. In this way, these variables are recurrently generated, which enables them to further capture strong and complex dependencies among the output translations at different timesteps. In order to deal with the challenges in performing efficient posterior inference and large-scale training during the incorporation of latent variables, we build a neural posterior approximator, and equip it with a reparameterization technique to estimate the variational lower bound. Experiments on Chinese-English and English-German translation tasks demonstrate that the proposed model achieves significant improvements over both the conventional and variational NMT models.