The Curious Case of Hallucinations in Neural Machine Translation

04/14/2021
by   Vikas Raunak, et al.
7

In this work, we study hallucinations in Neural Machine Translation (NMT), which lie at an extreme end on the spectrum of NMT pathologies. Firstly, we connect the phenomenon of hallucinations under source perturbation to the Long-Tail theory of Feldman (2020), and present an empirically validated hypothesis that explains hallucinations under source perturbation. Secondly, we consider hallucinations under corpus-level noise (without any source perturbation) and demonstrate that two prominent types of natural hallucinations (detached and oscillatory outputs) could be generated and explained through specific corpus-level noise patterns. Finally, we elucidate the phenomenon of hallucination amplification in popular data-generation processes such as Backtranslation and sequence-level Knowledge Distillation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/20/2017

Enabling Multi-Source Neural Machine Translation By Concatenating Source Sentences In Multiple Languages

In this paper, we propose a novel and elegant solution to "Multi-Source ...
research
02/19/2019

Semantic Neural Machine Translation using AMR

It is intuitive that semantic representations can be useful for machine ...
research
05/19/2023

Pseudo-Label Training and Model Inertia in Neural Machine Translation

Like many other machine learning applications, neural machine translatio...
research
04/13/2021

Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation

A conventional approach to improving the performance of end-to-end speec...
research
05/24/2019

An Analysis of Source-Side Grammatical Errors in NMT

The quality of Neural Machine Translation (NMT) has been shown to signif...
research
12/31/2020

Exploring Monolingual Data for Neural Machine Translation with Knowledge Distillation

We explore two types of monolingual data that can be included in knowled...
research
12/10/2020

Approches quantitatives de l'analyse des prédictions en traduction automatique neuronale (TAN)

As part of a larger project on optimal learning conditions in neural mac...

Please sign up or login with your details

Forgot password? Click here to reset