Unsupervised Neural Machine Translation

10/30/2017
by   Mikel Artetxe, et al.
0

In spite of the recent success of neural machine translation (NMT) in standard benchmarks, the lack of large parallel corpora poses a major practical problem for many language pairs. There have been several proposals to alleviate this issue with, for instance, triangulation and semi-supervised learning techniques, but they still require a strong cross-lingual signal. In this work, we completely remove the need of parallel data and propose a novel method to train an NMT system in a completely unsupervised manner, relying on nothing but monolingual corpora. Our model builds upon the recent work on unsupervised embedding mappings, and consists of a slightly modified attentional encoder-decoder model that can be trained on monolingual corpora alone using a combination of denoising and backtranslation. Despite the simplicity of the approach, our system obtains 15.56 and 10.21 BLEU points in WMT 2014 French-to-English and German-to-English translation. The model can also profit from small parallel corpora, and attains 21.81 and 15.24 points when combined with 100,000 parallel sentences, respectively. Our approach is a breakthrough in unsupervised NMT, and opens exciting opportunities for future research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/04/2018

Unsupervised Statistical Machine Translation

While modern machine translation has relied on large parallel corpora, a...
research
06/10/2021

Exploring Unsupervised Pretraining Objectives for Machine Translation

Unsupervised cross-lingual pretraining has achieved strong results in ne...
research
04/30/2020

Semi-Supervised Text Simplification with Back-Translation and Asymmetric Denoising Autoencoders

Text simplification (TS) rephrases long sentences into simplified varian...
research
06/05/2019

Deep learning based unsupervised concept unification in the embedding space

Humans are able to conceive physical reality by jointly learning differe...
research
07/29/2018

Fast derivation of neural network based document vectors with distance constraint and negative sampling

A universal cross-lingual representation of documents is very important ...
research
12/26/2019

Amharic-Arabic Neural Machine Translation

Many automatic translation works have been addressed between major Europ...
research
04/20/2018

Phrase-Based & Neural Unsupervised Machine Translation

Machine translation systems achieve near human-level performance on some...

Please sign up or login with your details

Forgot password? Click here to reset