Can Neural Machine Translation be Improved with User Feedback?

04/16/2018
by   Julia Kreutzer, et al.
0

We present the first real-world application of methods for improving neural machine translation (NMT) with human reinforcement, based on explicit and implicit user feedback collected on the eBay e-commerce platform. Previous work has been confined to simulation experiments, whereas in this paper we work with real logged feedback for offline bandit learning of NMT parameters. We conduct a thorough analysis of the available explicit user judgments---five-star ratings of translation quality---and show that they are not reliable enough to yield significant improvements in bandit learning. In contrast, we successfully utilize implicit task-based feedback collected in a cross-lingual search task to improve task-specific and machine translation quality metrics.

READ FULL TEXT
research
06/19/2018

Learning from Chunk-based Feedback in Neural Machine Translation

We empirically investigate learning from partial feedback in neural mach...
research
05/27/2018

Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning

We present a study on reinforcement learning (RL) from human bandit feed...
research
07/24/2017

Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback

Machine translation is a natural candidate problem for reinforcement lea...
research
02/22/2020

Machine Translation System Selection from Bandit Feedback

Adapting machine translation systems in the real world is a difficult pr...
research
07/04/2019

Interactive-Predictive Neural Machine Translation through Reinforcement and Imitation

We propose an interactive-predictive neural machine translation framewor...
research
08/03/2017

The UMD Neural Machine Translation Systems at WMT17 Bandit Learning Task

We describe the University of Maryland machine translation systems submi...
research
08/27/2021

Secoco: Self-Correcting Encoding for Neural Machine Translation

This paper presents Self-correcting Encoding (Secoco), a framework that ...

Please sign up or login with your details

Forgot password? Click here to reset