On NMT Search Errors and Model Errors: Cat Got Your Tongue?

08/27/2019
by   Felix Stahlberg, et al.
0

We report on search errors and model errors in neural machine translation (NMT). We present an exact inference procedure for neural sequence models based on a combination of beam search and depth-first search. We use our exact search to find the global best model scores under a Transformer base model for the entire WMT15 English-German test set. Surprisingly, beam search fails to find these global best model scores in most cases, even with a very large beam size of 100. For more than 50 global best score to the empty translation, revealing a massive failure of neural models in properly accounting for adequacy. We show by constraining search with a minimum translation length that at the root of the problem of empty translations lies an inherent bias towards shorter translations. We conclude that vanilla NMT in its current form requires just the right amount of beam search errors, which, from a modelling perspective, is a highly unsatisfactory conclusion indeed, as the model often prefers an empty translation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/29/2018

Correcting Length Bias in Neural Machine Translation

We study two problems in neural machine translation (NMT). First, in bea...
research
02/06/2017

Beam Search Strategies for Neural Machine Translation

The basic concept in Neural Machine Translation (NMT) is to train a larg...
research
08/02/2017

Analyzing Neural MT Search and Model Performance

In this paper, we offer an in-depth analysis about the modeling and sear...
research
04/15/2021

First the worst: Finding better gender translations during beam search

Neural machine translation inference procedures like beam search generat...
research
05/02/2022

The Implicit Length Bias of Label Smoothing on Beam Search Decoding

Label smoothing is ubiquitously applied in Neural Machine Translation (N...
research
05/07/2020

On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation

The standard training algorithm in neural machine translation (NMT) suff...
research
09/13/2021

Multi-Sentence Resampling: A Simple Approach to Alleviate Dataset Length Bias and Beam-Search Degradation

Neural Machine Translation (NMT) is known to suffer from a beam-search p...

Please sign up or login with your details

Forgot password? Click here to reset