Translating Translationese: A Two-Step Approach to Unsupervised Machine Translation

06/11/2019
by   Nima Pourdamghani, et al.
0

Given a rough, word-by-word gloss of a source language sentence, target language natives can uncover the latent, fully-fluent rendering of the translation. In this work we explore this intuition by breaking translation into a two step process: generating a rough gloss by means of a dictionary and then `translating' the resulting pseudo-translation, or `Translationese' into a fully fluent translation. We build our Translationese decoder once from a mish-mash of parallel data that has the target language in common and then can build dictionaries on demand using unsupervised techniques, resulting in rapidly generated unsupervised neural MT systems for many source languages. We apply this process to 14 test languages, obtaining better or comparable translation results on high-resource languages than previously published unsupervised MT studies, and obtaining good quality results for low-resource languages that have never been used in an unsupervised MT scenario.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/30/2021

What Can Unsupervised Machine Translation Contribute to High-Resource Language Pairs?

Whereas existing literature on unsupervised machine translation (MT) foc...
research
04/12/2020

When Does Unsupervised Machine Translation Work?

Despite the reported success of unsupervised machine translation (MT), t...
research
04/26/2022

Flow-Adapter Architecture for Unsupervised Machine Translation

In this work, we propose a flow-adapter architecture for unsupervised NM...
research
02/04/2019

Unsupervised Clinical Language Translation

As patients' access to their doctors' clinical notes becomes common, tra...
research
03/24/2023

Towards Making the Most of ChatGPT for Machine Translation

ChatGPT shows remarkable capabilities for machine translation (MT). Seve...
research
11/06/2018

Off-the-Shelf Unsupervised NMT

We frame unsupervised machine translation (MT) in the context of multi-t...
research
05/12/2023

Improving Cascaded Unsupervised Speech Translation with Denoising Back-translation

Most of the speech translation models heavily rely on parallel data, whi...

Please sign up or login with your details

Forgot password? Click here to reset