Learning Kernel-Smoothed Machine Translation with Retrieved Examples

09/21/2021

∙

How to effectively adapt neural machine translation (NMT) models according to emerging cases without retraining? Despite the great success of neural machine translation, updating the deployed models online remains a challenge. Existing non-parametric approaches that retrieve similar examples from a database to guide the translation process are promising but are prone to overfit the retrieved examples. However, non-parametric methods are prone to overfit the retrieved examples. In this work, we propose to learn Kernel-Smoothed Translation with Example Retrieval (KSTER), an effective approach to adapt neural machine translation models online. Experiments on domain adaptation and multi-domain machine translation datasets show that even without expensive retraining, KSTER is able to achieve improvement of 1.1 to 1.5 BLEU scores over the best existing online adaptation methods. The code and trained models are released at https://github.com/jiangqn/KSTER.

READ FULL TEXT

Learning Kernel-Smoothed Machine Translation with Retrieved Examples

Zero-shot Domain Adaptation for Neural Machine Translation with Retrieved Phrase-level Prompts

Non-Parametric Adaptation for Neural Machine Translation

Non-Parametric Online Learning from Human Feedback for Neural Machine Translation

Rapid Adaptation of Neural Machine Translation to New Languages

On the Evaluation of Machine Translation for Terminology Consistency

Mixed Cross Entropy Loss for Neural Machine Translation

Revisiting the Weaknesses of Reinforcement Learning for Neural Machine Translation

Learning Kernel-Smoothed Machine Translation with Retrieved Examples

Related Research

Zero-shot Domain Adaptation for Neural Machine Translation with Retrieved Phrase-level Prompts

Non-Parametric Adaptation for Neural Machine Translation

Non-Parametric Online Learning from Human Feedback for Neural Machine Translation

Rapid Adaptation of Neural Machine Translation to New Languages

On the Evaluation of Machine Translation for Terminology Consistency

Mixed Cross Entropy Loss for Neural Machine Translation

Revisiting the Weaknesses of Reinforcement Learning for Neural Machine Translation