Domain Robustness in Neural Machine Translation

11/08/2019
by   Mathias Müller, et al.
0

Translating text that diverges from the training domain is a key challenge for neural machine translation (NMT). Domain robustness - the generalization of models to unseen test domains - is low compared to statistical machine translation. In this paper, we investigate the performance of NMT on out-of-domain test sets, and ways to improve it. We observe that hallucination (translations that are fluent but unrelated to the source) is common in out-of-domain settings, and we empirically compare methods that improve adequacy (reconstruction), out-of-domain translation (subword regularization), or robustness against adversarial examples (defensive distillation), as well as noisy channel models. In experiments on German to English OPUS data, and German to Romansh, a low-resource scenario, we find that several methods improve domain robustness, reconstruction standing out as a method that not only improves automatic scores, but also shows improvements in a manual assessments of adequacy, albeit at some loss in fluency. However, out-of-domain performance is still relatively low and domain robustness remains an open problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/29/2022

Benchmarking Azerbaijani Neural Machine Translation

Little research has been done on Neural Machine Translation (NMT) for Az...
research
09/27/2019

On the use of BERT for Neural Machine Translation

Exploiting large pretrained models for various NMT tasks have gained a l...
research
06/02/2019

Domain Adaptive Inference for Neural Machine Translation

We investigate adaptive ensemble weighting for Neural Machine Translatio...
research
12/19/2016

Neural Machine Translation from Simplified Translations

Text simplification aims at reducing the lexical, grammatical and struct...
research
01/06/2022

Phrase-level Adversarial Example Generation for Neural Machine Translation

While end-to-end neural machine translation (NMT) has achieved impressiv...
research
01/02/2021

Decoding Time Lexical Domain Adaptationfor Neural Machine Translation

Machine translation systems are vulnerable to domain mismatch, especiall...
research
04/29/2018

Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates

Subword units are an effective way to alleviate the open vocabulary prob...

Please sign up or login with your details

Forgot password? Click here to reset