When Does Unsupervised Machine Translation Work?

04/12/2020
by   Kelly Marchisio, et al.
0

Despite the reported success of unsupervised machine translation (MT), the field has yet to examine the conditions under which these methods succeed, and where they fail. We conduct an extensive empirical evaluation of unsupervised MT using dissimilar language pairs, dissimilar domains, diverse datasets, and authentic low-resource languages. We find that performance rapidly deteriorates when source and target corpora are from different domains, and that random word embedding initialization can dramatically affect downstream translation performance. We additionally find that unsupervised MT performance declines when source and target languages use different scripts, and observe very poor performance on authentic low-resource language pairs. We advocate for extensive empirical evaluation of unsupervised MT systems to highlight failure points and encourage continued research on the most promising paradigms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/30/2021

What Can Unsupervised Machine Translation Contribute to High-Resource Language Pairs?

Whereas existing literature on unsupervised machine translation (MT) foc...
research
10/12/2022

SilverAlign: MT-Based Silver Data Algorithm For Evaluating Word Alignment

Word alignments are essential for a variety of NLP tasks. Therefore, cho...
research
06/11/2019

Translating Translationese: A Two-Step Approach to Unsupervised Machine Translation

Given a rough, word-by-word gloss of a source language sentence, target ...
research
01/31/2023

Machine Translation Impact in E-commerce Multilingual Search

Previous work suggests that performance of cross-lingual information ret...
research
05/09/2018

On the Limitations of Unsupervised Bilingual Dictionary Induction

Unsupervised machine translation---i.e., not assuming any cross-lingual ...
research
09/28/2022

From Zero to Production: Baltic-Ukrainian Machine Translation Systems to Aid Refugees

In this paper, we examine the development and usage of six low-resource ...
research
05/06/2020

Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting

Unsupervised machine translation (MT) has recently achieved impressive r...

Please sign up or login with your details

Forgot password? Click here to reset