Optimal Transport for Unsupervised Hallucination Detection in Neural Machine Translation

12/19/2022
by   Nuno M. Guerreiro, et al.
0

Neural machine translation (NMT) has become the de-facto standard in real-world machine translation applications. However, NMT models can unpredictably produce severely pathological translations, known as hallucinations, that seriously undermine user trust. It becomes thus crucial to implement effective preventive strategies to guarantee their proper functioning. In this paper, we address the problem of hallucination detection in NMT by following a simple intuition: as hallucinations are detached from the source content, they exhibit encoder-decoder attention patterns that are statistically different from those of good quality translations. We frame this problem with an optimal transport formulation and propose a fully unsupervised, plug-in detector that can be used with any attention-based NMT model. Experimental results show that our detector not only outperforms all previous model-based detectors, but is also competitive with detectors that employ large models trained on millions of samples.

READ FULL TEXT
research
06/09/2020

Universal Vector Neural Machine Translation With Effective Attention

Neural Machine Translation (NMT) leverages one or more trained neural ne...
research
08/08/2018

Debugging Neural Machine Translations

In this paper, we describe a tool for debugging the output and attention...
research
05/10/2016

Coverage Embedding Models for Neural Machine Translation

In this paper, we enhance the attention-based neural machine translation...
research
03/03/2019

Calibration of Encoder Decoder Models for Neural Machine Translation

We study the calibration of several state of the art neural machine tran...
research
06/25/2022

Probing Causes of Hallucinations in Neural Machine Translations

Hallucination, one kind of pathological translations that bothers Neural...
research
07/06/2018

Oracle-free Detection of Translation Issue for Neural Machine Translation

Neural Machine Translation (NMT) has been widely adopted over recent yea...
research
06/11/2021

Towards User-Driven Neural Machine Translation

A good translation should not only translate the original content semant...

Please sign up or login with your details

Forgot password? Click here to reset