Meta-Learning for Few-Shot NMT Adaptation

04/06/2020
by   Amr Sharaf, et al.
0

We present META-MT, a meta-learning approach to adapt Neural Machine Translation (NMT) systems in a few-shot setting. META-MT provides a new approach to make NMT models easily adaptable to many target domains with the minimal amount of in-domain data. We frame the adaptation of NMT systems as a meta-learning problem, where we learn to adapt to new unseen domains based on simulated offline meta-training domain adaptation tasks. We evaluate the proposed meta-learning strategy on ten domains with general large scale NMT systems. We show that META-MT significantly outperforms classical domain adaptation when very few in-domain examples are available. Our experiments shows that META-MT can outperform classical fine-tuning by up to 2.5 BLEU points after seeing only 4, 000 translated words (300 parallel sentences).

READ FULL TEXT

page 3

page 7

page 8

research
03/03/2021

Meta-Curriculum Learning for Domain Adaptation in Neural Machine Translation

Meta-learning has been sufficiently validated to be beneficial for low-r...
research
01/29/2021

Few-Shot Domain Adaptation for Grammatical Error Correction via Meta-Learning

Most existing Grammatical Error Correction (GEC) methods based on sequen...
research
02/22/2021

Domain Adaptation in Dialogue Systems using Transfer and Meta-Learning

Current generative-based dialogue systems are data-hungry and fail to ad...
research
11/08/2022

What Knowledge Is Needed? Towards Explainable Memory for kNN-MT Domain Adaptation

kNN-MT presents a new paradigm for domain adaptation by building an exte...
research
06/01/2019

Learning to Transfer: Unsupervised Meta Domain Translation

Unsupervised domain translation has recently achieved impressive perform...
research
11/22/2021

Reinforcement Learning for Few-Shot Text Generation Adaptation

Controlling the generative model to adapt a new domain with limited samp...
research
09/23/2022

Expanding the Deployment Envelope of Behavior Prediction via Adaptive Meta-Learning

Learning-based behavior prediction methods are increasingly being deploy...

Please sign up or login with your details

Forgot password? Click here to reset