Language Modeling, Lexical Translation, Reordering: The Training Process of NMT through the Lens of Classical SMT

09/03/2021
by   Elena Voita, et al.
12

Differently from the traditional statistical MT that decomposes the translation task into distinct separately learned components, neural machine translation uses a single neural network to model the entire translation process. Despite neural machine translation being de-facto standard, it is still not clear how NMT models acquire different competences over the course of training, and how this mirrors the different models in traditional SMT. In this work, we look at the competences related to three core SMT components and find that during training, NMT first focuses on learning target-side language modeling, then improves translation quality approaching word-by-word translation, and finally learns more complicated reordering patterns. We show that this behavior holds for several models and language pairs. Additionally, we explain how such an understanding of the training process can be useful in practice and, as an example, show how it can be used to improve vanilla non-autoregressive neural machine translation by guiding teacher model selection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/25/2017

Machine Translation at Booking.com: Journey and Lessons Learned

We describe our recently developed neural machine translation (NMT) syst...
research
11/01/2017

Improving Neural Machine Translation through Phrase-based Forced Decoding

Compared to traditional statistical machine translation (SMT), neural ma...
research
11/30/2022

Word Alignment in the Era of Deep Learning: A Tutorial

The word alignment task, despite its prominence in the era of statistica...
research
04/05/2020

Understanding Learning Dynamics for Neural Machine Translation

Despite the great success of NMT, there still remains a severe challenge...
research
10/21/2020

Analyzing the Source and Target Contributions to Predictions in Neural Machine Translation

In Neural Machine Translation (and, more generally, conditional language...
research
01/14/2019

Unsupervised Neural Machine Translation with SMT as Posterior Regularization

Without real bilingual corpus available, unsupervised Neural Machine Tra...
research
02/17/2015

A Survey of Word Reordering in Statistical Machine Translation: Computational Models and Language Phenomena

Word reordering is one of the most difficult aspects of statistical mach...

Please sign up or login with your details

Forgot password? Click here to reset