Improving Target-side Lexical Transfer in Multilingual Neural Machine Translation

10/04/2020
by   Luyu Gao, et al.
0

To improve the performance of Neural Machine Translation (NMT) for low-resource languages (LRL), one effective strategy is to leverage parallel data from a related high-resource language (HRL). However, multilingual data has been found more beneficial for NMT models that translate from the LRL to a target language than the ones that translate into the LRLs. In this paper, we aim to improve the effectiveness of multilingual transfer for NMT models that translate into the LRL, by designing a better decoder word embedding. Extending upon a general-purpose multilingual encoding method Soft Decoupled Encoding <cit.>, we propose DecSDE, an efficient character n-gram based embedding specifically designed for the NMT decoder. Our experiments show that DecSDE leads to consistent gains of up to 1.8 BLEU on translation from English to four different languages.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/09/2019

Multilingual Neural Machine Translation With Soft Decoupled Encoding

Multilingual training of neural machine translation (NMT) systems has le...
research
05/20/2019

Target Conditioned Sampling: Optimizing Data Selection for Multilingual Neural Machine Translation

To improve low-resource Neural Machine Translation (NMT) with multilingu...
research
09/14/2021

Efficient Inference for Multilingual Neural Machine Translation

Multilingual NMT has become an attractive solution for MT deployment in ...
research
03/31/2023

ℰ KÚ [MASK]: Integrating Yorùbá cultural greetings into machine translation

This paper investigates the performance of massively multilingual neural...
research
09/06/2018

Character-Aware Decoder for Neural Machine Translation

Standard neural machine translation (NMT) systems operate primarily on w...
research
05/23/2022

Local Byte Fusion for Neural Machine Translation

Subword tokenization schemes are the dominant technique used in current ...
research
10/30/2019

Adapting Multilingual Neural Machine Translation to Unseen Languages

Multilingual Neural Machine Translation (MNMT) for low-resource language...

Please sign up or login with your details

Forgot password? Click here to reset