Multilingual Unsupervised Neural Machine Translation with Denoising Adapters

10/20/2021
by   Ahmet Üstün, et al.
0

We consider the problem of multilingual unsupervised machine translation, translating to and from languages that only have monolingual data by using auxiliary parallel language pairs. For this problem the standard procedure so far to leverage the monolingual data is back-translation, which is computationally costly and hard to tune. In this paper we propose instead to use denoising adapters, adapter layers with a denoising objective, on top of pre-trained mBART-50. In addition to the modularity and flexibility of such an approach we show that the resulting translations are on-par with back-translating as measured by BLEU, and furthermore it allows adding unseen languages incrementally.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/19/2021

Integrating Unsupervised Data Generation into Self-Supervised Neural Machine Translation for Low-Resource Languages

For most language combinations, parallel data is either scarce or simply...
research
04/07/2020

Unsupervised Neural Machine Translation with Indirect Supervision

Neural machine translation (NMT) is ineffective for zero-resource langua...
research
03/11/2021

Towards Continual Learning for Multilingual Machine Translation via Vocabulary Substitution

We propose a straightforward vocabulary adaptation scheme to extend the ...
research
05/31/2023

Automatic Discrimination of Human and Neural Machine Translation in Multilingual Scenarios

We tackle the task of automatically discriminating between human and mac...
research
11/14/2021

DEEP: DEnoising Entity Pre-training for Neural Machine Translation

It has been shown that machine translation models usually generate poor ...
research
08/15/2023

VBD-MT Chinese-Vietnamese Translation Systems for VLSP 2022

We present our systems participated in the VLSP 2022 machine translation...
research
05/12/2023

Improving Cascaded Unsupervised Speech Translation with Denoising Back-translation

Most of the speech translation models heavily rely on parallel data, whi...

Please sign up or login with your details

Forgot password? Click here to reset