Learning a Multitask Curriculum for Neural Machine Translation

08/28/2019
by   Wei Wang, et al.
0

Existing curriculum learning research in neural machine translation (NMT) mostly focuses on a single final task such as selecting data for a domain or for denoising, and considers in-task example selection. This paper studies the data selection problem in multitask setting. We present a method to learn a multitask curriculum on a single, diverse, potentially noisy training dataset. It computes multiple data selection scores for each training example, each score measuring how useful the example is to a certain task. It uses Bayesian optimization to learn a linear weighting of these per-instance scores, and then sorts the data to form a curriculum. We experiment with three domain translation tasks: two specific domains and the general domain, and demonstrate that the learned multitask curriculum delivers results close to individually optimized models and brings solid gains over no curriculum training, across all test sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/25/2022

Data Selection Curriculum for Neural Machine Translation

Neural Machine Translation (NMT) models are typically trained on heterog...
research
06/03/2019

Dynamically Composing Domain-Data Selection with Clean-Data Selection by "Co-Curricular Learning" for Neural Machine Translation

Noise and domain are important aspects of data quality for neural machin...
research
07/14/2023

A Quantitative Approach to Predicting Representational Learning and Performance in Neural Networks

A key property of neural networks (both biological and artificial) is ho...
research
08/31/2018

Denoising Neural Machine Translation Training with Trusted Data and Online Data Selection

Measuring domain relevance of data and identifying or selecting well-fit...
research
07/14/2019

Task Selection Policies for Multitask Learning

One of the questions that arises when designing models that learn to sol...
research
11/13/2021

On the Statistical Benefits of Curriculum Learning

Curriculum learning (CL) is a commonly used machine learning training st...
research
02/27/2023

Make Every Example Count: On Stability and Utility of Self-Influence for Learning from Noisy NLP Datasets

Increasingly larger datasets have become a standard ingredient to advanc...

Please sign up or login with your details

Forgot password? Click here to reset