Unsupervised label noise modeling and loss correction

04/25/2019
by   Eric Arazo, et al.
12

Despite being robust to small amounts of label noise, convolutional neural networks trained with stochastic gradient methods have been shown to easily fit random labels. When there are a mixture of correct and mislabelled targets, networks tend to fit the former before the latter. This suggests using a suitable two-component mixture model as an unsupervised generative model of sample loss values during training to allow online estimation of the probability that a sample is mislabelled. Specifically, we propose a beta mixture to estimate this probability and correct the loss by relying on the network prediction (the so-called bootstrapping loss). We further adapt mixup augmentation to drive our approach a step further. Experiments on CIFAR-10/100 and TinyImageNet demonstrate a robustness to label noise that substantially outperforms recent state-of-the-art. Source code is available at https://git.io/fjsvE

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/27/2022

Synergistic Network Learning and Label Correction for Noise-robust Image Classification

Large training datasets almost always contain examples with inaccurate o...
research
12/03/2022

CrossSplit: Mitigating Label Noise Memorization through Data Splitting

We approach the problem of improving robustness of deep learning algorit...
research
09/13/2016

Making Deep Neural Networks Robust to Label Noise: a Loss Correction Approach

We present a theoretically grounded approach to train deep neural networ...
research
01/27/2021

Towards Robustness to Label Noise in Text Classification via Noise Modeling

Large datasets in NLP suffer from noisy labels, due to erroneous automat...
research
10/28/2017

Label Embedding Network: Learning Label Representation for Soft Training of Deep Networks

We propose a method, called Label Embedding Network, which can learn lab...
research
10/21/2021

Multi-label Classification with Partial Annotations using Class-aware Selective Loss

Large-scale multi-label classification datasets are commonly, and perhap...
research
02/18/2021

Deep Learning for Suicide and Depression Identification with Unsupervised Label Correction

Early detection of suicidal ideation in depressed individuals can allow ...

Please sign up or login with your details

Forgot password? Click here to reset