Deep Learning is Robust to Massive Label Noise

05/30/2017
by   David Rolnick, et al.
0

Deep neural networks trained on large supervised datasets have led to impressive results in recent years. However, since well-annotated datasets can be prohibitively expensive and time-consuming to collect, recent work has explored the use of larger but noisy datasets that can be more easily obtained. In this paper, we investigate the behavior of deep neural networks on training sets with massively noisy labels. We show that successful learning is possible even with an essentially arbitrary amount of noise. For example, on MNIST we find that accuracy of above 90 percent is still attainable even when the dataset has been diluted with 100 noisy examples for each clean example. Such behavior holds across multiple patterns of label noise, even when noisy labels are biased towards confusing classes. Further, we show how the required dataset size for successful training increases with higher label noise. Finally, we present simple actionable techniques for improving learning in the regime of high label noise.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/19/2020

Error-Bounded Correction of Noisy Labels

To collect large scale annotated data, it is inevitable to introduce lab...
research
07/20/2021

kNet: A Deep kNN Network To Handle Label Noise

Deep Neural Networks require large amounts of labeled data for their tra...
research
10/02/2022

The Dynamic of Consensus in Deep Networks and the Identification of Noisy Labels

Deep neural networks have incredible capacity and expressibility, and ca...
research
12/01/2022

Noisy Label Detection for Speaker Recognition

The success of deep neural networks requires both high annotation qualit...
research
03/22/2021

On the Robustness of Monte Carlo Dropout Trained with Noisy Labels

The memorization effect of deep learning hinders its performance to effe...
research
12/09/2020

A Topological Filter for Learning with Label Noise

Noisy labels can impair the performance of deep neural networks. To tack...
research
06/09/2014

Training Convolutional Networks with Noisy Labels

The availability of large labeled datasets has allowed Convolutional Net...

Please sign up or login with your details

Forgot password? Click here to reset