Self-Regulated Interactive Sequence-to-Sequence Learning

07/11/2019
by   Julia Kreutzer, et al.
0

Not all types of supervision signals are created equal: Different types of feedback have different costs and effects on learning. We show how self-regulation strategies that decide when to ask for which kind of feedback from a teacher (or from oneself) can be cast as a learning-to-learn problem leading to improved cost-aware sequence-to-sequence learning. In experiments on interactive neural machine translation, we find that the self-regulator discovers an ϵ-greedy strategy for the optimal cost-quality trade-off by mixing different feedback types including corrections, error markups, and self-supervision. Furthermore, we demonstrate its robustness under domain shift and identify it as a promising alternative to active learning.

READ FULL TEXT
research
04/21/2017

Bandit Structured Prediction for Neural Sequence-to-Sequence Learning

Bandit structured prediction describes a stochastic optimization framewo...
research
05/20/2019

A Neural, Interactive-predictive System for Multimodal Sequence to Sequence Tasks

We present a demonstration of a neural interactive-predictive system for...
research
04/23/2020

Correct Me If You Can: Learning from Error Corrections and Markings

Sequence-to-sequence learning involves a trade-off between signal streng...
research
07/04/2019

Interactive-Predictive Neural Machine Translation through Reinforcement and Imitation

We propose an interactive-predictive neural machine translation framewor...
research
07/30/2018

Active Learning for Interactive Neural Machine Translation of Data Streams

We study the application of active learning techniques to the translatio...
research
05/27/2018

Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning

We present a study on reinforcement learning (RL) from human bandit feed...
research
05/30/2019

Interactive-predictive neural multimodal systems

Despite the advances achieved by neural models in sequence to sequence l...

Please sign up or login with your details

Forgot password? Click here to reset