Approximating Instance-Dependent Noise via Instance-Confidence Embedding

03/25/2021
by   Yivan Zhang, et al.
1

Label noise in multiclass classification is a major obstacle to the deployment of learning systems. However, unlike the widely used class-conditional noise (CCN) assumption that the noisy label is independent of the input feature given the true label, label noise in real-world datasets can be aleatory and heavily dependent on individual instances. In this work, we investigate the instance-dependent noise (IDN) model and propose an efficient approximation of IDN to capture the instance-specific label corruption. Concretely, noting the fact that most columns of the IDN transition matrix have only limited influence on the class-posterior estimation, we propose a variational approximation that uses a single-scalar confidence parameter. To cope with the situation where the mapping from the instance to its confidence value could vary significantly for two adjacent instances, we suggest using instance embedding that assigns a trainable parameter to each instance. The resulting instance-confidence embedding (ICE) method not only performs well under label noise but also can effectively detect ambiguous or mislabeled instances. We validate its utility on various image and text classification tasks.

READ FULL TEXT

page 9

page 17

page 19

page 21

page 22

research
01/11/2020

Confidence Scores Make Instance-dependent Label-noise Learning Possible

Learning with noisy labels has drawn a lot of attention. In this area, m...
research
06/14/2020

Parts-dependent Label Noise: Towards Instance-dependent Label Noise

Learning with the instance-dependent label noise is challenging, because...
research
06/06/2022

Instance-Dependent Label-Noise Learning with Manifold-Regularized Transition Matrix Estimation

In label-noise learning, estimating the transition matrix has attracted ...
research
12/10/2020

Beyond Class-Conditional Assumption: A Primary Attempt to Combat Instance-Dependent Label Noise

Supervised learning under label noise has seen numerous advances recentl...
research
07/19/2023

GenKL: An Iterative Framework for Resolving Label Ambiguity and Label Non-conformity in Web Images Via a New Generalized KL Divergence

Web image datasets curated online inherently contain ambiguous in-distri...
research
10/05/2020

Learning with Instance-Dependent Label Noise: A Sample Sieve Approach

Human-annotated labels are often prone to noise, and the presence of suc...
research
06/05/2023

Transferring Annotator- and Instance-dependent Transition Matrix for Learning from Crowds

Learning from crowds describes that the annotations of training data are...

Please sign up or login with your details

Forgot password? Click here to reset