Curriculum Learning with Adam: The Devil Is in the Wrong Details

08/23/2023
by   Lucas Weber, et al.
0

Curriculum learning (CL) posits that machine learning models – similar to humans – may learn more efficiently from data that match their current learning progress. However, CL methods are still poorly understood and, in particular for natural language processing (NLP), have achieved only limited success. In this paper, we explore why. Starting from an attempt to replicate and extend a number of recent curriculum methods, we find that their results are surprisingly brittle when applied to NLP. A deep dive into the (in)effectiveness of the curricula in some scenarios shows us why: when curricula are employed in combination with the popular Adam optimisation algorithm, they oftentimes learn to adapt to suboptimally chosen optimisation parameters for this algorithm. We present a number of different case studies with different common hand-crafted and automated CL approaches to illustrate this phenomenon, and we find that none of them outperforms optimisation with only Adam with well-chosen hyperparameters. As such, our results contribute to understanding why CL methods work, but at the same time urge caution when claiming positive results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/18/2016

Visualizing and Understanding Curriculum Learning for Long Short-Term Memory Networks

Curriculum Learning emphasizes the order of training instances in a comp...
research
05/10/2020

A SentiWordNet Strategy for Curriculum Learning in Sentiment Analysis

Curriculum Learning (CL) is the idea that learning on a training set seq...
research
01/25/2021

Curriculum Learning: A Survey

Training machine learning models in a meaningful order, from the easy sa...
research
10/11/2022

Discovered Policy Optimisation

Tremendous progress has been made in reinforcement learning (RL) over th...
research
09/27/2018

An Empirical Comparison of Syllabuses for Curriculum Learning

Syllabuses for curriculum learning have been developed on an ad-hoc, per...
research
11/28/2022

Learning to Learn: How to Continuously Teach Humans and Machines

Our education system comprises a series of curricula. For example, when ...
research
11/24/2022

The intersection of machine learning with forecasting and optimisation: theory and applications

Forecasting and optimisation are two major fields of operations research...

Please sign up or login with your details

Forgot password? Click here to reset