Provable Advantage of Curriculum Learning on Parity Targets with Mixed Inputs

06/29/2023
by   Emmanuel Abbe, et al.
0

Experimental results have shown that curriculum learning, i.e., presenting simpler examples before more complex ones, can improve the efficiency of learning. Some recent theoretical results also showed that changing the sampling distribution can help neural networks learn parities, with formal results only for large learning rates and one-step arguments. Here we show a separation result in the number of training steps with standard (bounded) learning rates on a common sample distribution: if the data distribution is a mixture of sparse and dense inputs, there exists a regime in which a 2-layer ReLU neural network trained by a curriculum noisy-GD (or SGD) algorithm that uses sparse examples first, can learn parities of sufficiently large degree, while any fully connected neural network of possibly larger width or depth trained by noisy-GD on the unordered samples cannot learn without additional steps. We also provide experimental results supporting the qualitative separation beyond the specific regime of the theoretical results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/27/2021

Statistical Measures For Defining Curriculum Scoring Function

Curriculum learning is a training strategy that sorts the training examp...
research
05/18/2022

LeRaC: Learning Rate Curriculum

Most curriculum learning methods require an approach to sort the data sa...
research
01/31/2023

A Mathematical Model for Curriculum Learning

Curriculum learning (CL) - training using samples that are generated and...
research
10/16/2020

A case where a spindly two-layer linear network whips any neural network with a fully connected input layer

It was conjectured that any neural network of any structure and arbitrar...
research
12/14/2022

Maximal Initial Learning Rates in Deep ReLU Networks

Training a neural network requires choosing a suitable learning rate, in...
research
12/05/2020

When Do Curricula Work?

Inspired by human learning, researchers have proposed ordering examples ...
research
01/30/2023

Generalization on the Unseen, Logic Reasoning and Degree Curriculum

This paper considers the learning of logical (Boolean) functions with fo...

Please sign up or login with your details

Forgot password? Click here to reset