Deep Learning is Provably Robust to Symmetric Label Noise

10/26/2022
by   Carey E. Priebe, et al.
0

Deep neural networks (DNNs) are capable of perfectly fitting the training data, including memorizing noisy data. It is commonly believed that memorization hurts generalization. Therefore, many recent works propose mitigation strategies to avoid noisy data or correct memorization. In this work, we step back and ask the question: Can deep learning be robust against massive label noise without any mitigation? We provide an affirmative answer for the case of symmetric label noise: We find that certain DNNs, including under-parameterized and over-parameterized models, can tolerate massive symmetric label noise up to the information-theoretic threshold. By appealing to classical statistical theory and universal consistency of DNNs, we prove that for multiclass classification, L_1-consistent DNN classifiers trained under symmetric label noise can achieve Bayes optimality asymptotically if the label noise probability is less than K-1/K, where K ≥ 2 is the number of classes. Our results show that for symmetric label noise, no mitigation is necessary for L_1-consistent estimators. We conjecture that for general label noise, mitigation strategies that make use of the noisy data will outperform those that ignore the noisy data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/03/2023

Feature Noise Boosts DNN Generalization under Label Noise

The presence of label noise in the training data has a profound impact o...
research
06/27/2022

Towards Harnessing Feature Embedding for Robust Learning with Noisy Labels

The memorization effect of deep neural networks (DNNs) plays a pivotal r...
research
05/05/2019

Learning Graph Neural Networks with Noisy Labels

We study the robustness to symmetric label noise of GNNs training proced...
research
01/14/2021

Tackling Instance-Dependent Label Noise via a Universal Probabilistic Model

The drastic increase of data quantity often brings the severe decrease o...
research
02/10/2023

Confidence-based Reliable Learning under Dual Noises

Deep neural networks (DNNs) have achieved remarkable success in a variet...
research
05/27/2019

Emphasis Regularisation by Gradient Rescaling for Training Deep Neural Networks with Noisy Labels

It is fundamental and challenging to train robust and accurate Deep Neur...
research
03/21/2023

Dynamic-Aware Loss for Learning with Label Noise

Label noise poses a serious threat to deep neural networks (DNNs). Emplo...

Please sign up or login with your details

Forgot password? Click here to reset