Error-Bounded Correction of Noisy Labels

11/19/2020
by   Songzhu Zheng, et al.
2

To collect large scale annotated data, it is inevitable to introduce label noise, i.e., incorrect class labels. To be robust against label noise, many successful methods rely on the noisy classifiers (i.e., models trained on the noisy training data) to determine whether a label is trustworthy. However, it remains unknown why this heuristic works well in practice. In this paper, we provide the first theoretical explanation for these methods. We prove that the prediction of a noisy classifier can indeed be a good indicator of whether the label of a training data is clean. Based on the theoretical result, we propose a novel algorithm that corrects the labels based on the noisy classifier prediction. The corrected labels are consistent with the true Bayesian optimal classifier with high probability. We incorporate our label correction algorithm into the training of deep neural networks and train models that achieve superior testing performance on multiple public datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/30/2017

Deep Learning is Robust to Massive Label Noise

Deep neural networks trained on large supervised datasets have led to im...
research
06/01/2021

Instance Correction for Learning with Open-set Noisy Labels

The problem of open-set noisy labels denotes that part of training data ...
research
11/24/2018

Alternating Loss Correction for Preterm-Birth Prediction from EHR Data with Noisy Labels

In this paper we are interested in the prediction of preterm birth based...
research
12/09/2020

A Topological Filter for Learning with Label Noise

Noisy labels can impair the performance of deep neural networks. To tack...
research
11/09/2018

Skeptical Deep Learning with Distribution Correction

Recently deep neural networks have been successfully used for various cl...
research
03/13/2021

Learning with Feature-Dependent Label Noise: A Progressive Approach

Label noise is frequently observed in real-world large-scale datasets. T...
research
05/29/2018

Classification with imperfect training labels

We study the effect of imperfect training data labels on the performance...

Please sign up or login with your details

Forgot password? Click here to reset