Transfer Learning across Low-Resource, Related Languages for Neural Machine Translation

08/31/2017
by   Toan Q. Nguyen, et al.
0

We present a simple method to improve neural translation of a low-resource language pair using parallel data from a related, also low-resource, language pair. The method is based on the transfer method of Zoph et al., but whereas their method ignores any source vocabulary overlap, ours exploits it. First, we split words using Byte Pair Encoding (BPE) to increase vocabulary overlap. Then, we train a model on the first language pair and transfer its parameters, including its source word embeddings, to another model and continue training on the second language pair. Our experiments show that transfer learning helps word-based translation only slightly, but when used on top of a much stronger BPE baseline, it yields larger improvements of up to 4.3 BLEU.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/02/2018

Trivial Transfer Learning for Low-Resource Neural Machine Translation

Transfer learning has been proven as an effective technique for neural m...
research
08/08/2021

Machine Translation of Low-Resource Indo-European Languages

Transfer learning has been an important technique for low-resource neura...
research
09/24/2019

Transfer Learning across Languages from Someone Else's NMT Model

Neural machine translation is demanding in terms of training time, hardw...
research
05/09/2022

Sub-Word Alignment Is Still Useful: A Vest-Pocket Method for Enhancing Low-Resource Machine Translation

We leverage embedding duplication between aligned sub-words to extend th...
research
09/14/2019

A Universal Parent Model for Low-Resource Neural Machine Translation Transfer

Transfer learning from a high-resource language pair `parent' has been p...
research
10/06/2017

Low-resource bilingual lexicon extraction using graph based word embeddings

In this work we focus on the task of automatically extracting bilingual ...
research
01/06/2020

Exploring Benefits of Transfer Learning in Neural Machine Translation

Neural machine translation is known to require large numbers of parallel...

Please sign up or login with your details

Forgot password? Click here to reset