Leveraging an Alignment Set in Tackling Instance-Dependent Label Noise

07/10/2023
by   Donna Tjandra, et al.
0

Noisy training labels can hurt model performance. Most approaches that aim to address label noise assume label noise is independent from the input features. In practice, however, label noise is often feature or instance-dependent, and therefore biased (i.e., some instances are more likely to be mislabeled than others). E.g., in clinical care, female patients are more likely to be under-diagnosed for cardiovascular disease compared to male patients. Approaches that ignore this dependence can produce models with poor discriminative performance, and in many healthcare settings, can exacerbate issues around health disparities. In light of these limitations, we propose a two-stage approach to learn in the presence instance-dependent label noise. Our approach utilizes points, a small subset of data for which we know the observed and ground truth labels. On several tasks, our approach leads to consistent improvements over the state-of-the-art in discriminative performance (AUROC) while mitigating bias (area under the equalized odds curve, AUEOC). For example, when predicting acute respiratory failure onset on the MIMIC-III dataset, our approach achieves a harmonic mean (AUROC and AUEOC) of 0.84 (SD [standard deviation] 0.01) while that of the next best baseline is 0.81 (SD 0.01). Overall, our approach improves accuracy while mitigating potential bias compared to existing approaches in the presence of instance-dependent label noise.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2021

Rethinking Noisy Label Models: Labeler-Dependent Noise with Adversarial Awareness

Most studies on learning from noisy labels rely on unrealistic models of...
research
12/22/2020

A Second-Order Approach to Learning with Instance-Dependent Label Noise

The presence of label noise often misleads the training of deep neural n...
research
12/06/2021

Two Wrongs Don't Make a Right: Combating Confirmation Bias in Learning with Label Noise

Noisy labels damage the performance of deep networks. For robust learnin...
research
06/14/2020

Parts-dependent Label Noise: Towards Instance-dependent Label Noise

Learning with the instance-dependent label noise is challenging, because...
research
03/15/2020

NoiseRank: Unsupervised Label Noise Reduction with Dependence Models

Label noise is increasingly prevalent in datasets acquired from noisy ch...
research
11/29/2022

On Robust Learning from Noisy Labels: A Permutation Layer Approach

The existence of label noise imposes significant challenges (e.g., poor ...
research
03/13/2021

Supervised Learning in the Presence of Noise: Application in ICD-10 Code Classification

ICD coding is the international standard for capturing and reporting hea...

Please sign up or login with your details

Forgot password? Click here to reset