Learning with Bounded Instance- and Label-dependent Label Noise

09/12/2017
by   Jiacheng Cheng, et al.
0

Instance- and label-dependent label noise (ILN) is widely existed in real-world datasets but has been rarely studied. In this paper, we focus on a particular case of ILN where the label noise rates, representing the probabilities that the true labels of examples flip into the corrupted labels, have upper bounds. We propose to handle this bounded instance- and label-dependent label noise under two different conditions. First, theoretically, we prove that when the marginal distributions P(X|Y=+1) and P(X|Y=-1) have non-overlapping supports, we can recover every noisy example's true label and perform supervised learning directly on the cleansed examples. Second, for the overlapping situation, we propose a novel approach to learn a well-performing classifier which needs only a few noisy examples to be labeled manually. Experimental results demonstrate that our method works well on both synthetic and real-world datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/11/2020

Confidence Scores Make Instance-dependent Label-noise Learning Possible

Learning with noisy labels has drawn a lot of attention. In this area, m...
research
08/11/2017

Learning from Noisy Label Distributions

In this paper, we consider a novel machine learning problem, that is, le...
research
01/25/2022

GMM Discriminant Analysis with Noisy Label for Each Class

Real world datasets often contain noisy labels, and learning from such d...
research
08/07/2018

Instance-Dependent PU Learning by Bayesian Optimal Relabeling

When learning from positive and unlabelled data, it is a strong assumpti...
research
10/18/2022

CNT (Conditioning on Noisy Targets): A new Algorithm for Leveraging Top-Down Feedback

We propose a novel regularizer for supervised learning called Conditioni...
research
03/26/2021

Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks

We algorithmically identify label errors in the test sets of 10 of the m...
research
06/24/2022

How many labelers do you have? A closer look at gold-standard labels

The construction of most supervised learning datasets revolves around co...

Please sign up or login with your details

Forgot password? Click here to reset