NoiseRank: Unsupervised Label Noise Reduction with Dependence Models

03/15/2020
by   Karishma Sharma, et al.
2

Label noise is increasingly prevalent in datasets acquired from noisy channels. Existing approaches that detect and remove label noise generally rely on some form of supervision, which is not scalable and error-prone. In this paper, we propose NoiseRank, for unsupervised label noise reduction using Markov Random Fields (MRF). We construct a dependence model to estimate the posterior probability of an instance being incorrectly labeled given the dataset, and rank instances based on their estimated probabilities. Our method 1) Does not require supervision from ground-truth labels, or priors on label or noise distribution. 2) It is interpretable by design, enabling transparency in label noise removal. 3) It is agnostic to classifier architecture/optimization framework and content modality. These advantages enable wide applicability in real noise settings, unlike prior works constrained by one or more conditions. NoiseRank improves state-of-the-art classification on Food101-N ( 20 and is effective on high noise Clothing-1M ( 40

READ FULL TEXT

page 13

page 14

research
05/28/2021

Rethinking Noisy Label Models: Labeler-Dependent Noise with Adversarial Awareness

Most studies on learning from noisy labels rely on unrealistic models of...
research
11/20/2017

CleanNet: Transfer Learning for Scalable Image Classifier Training with Label Noise

In this paper, we study the problem of learning image classification mod...
research
12/16/2022

Instance-specific Label Distribution Regularization for Learning with Label Noise

Modeling noise transition matrix is a kind of promising method for learn...
research
07/10/2023

Leveraging an Alignment Set in Tackling Instance-Dependent Label Noise

Noisy training labels can hurt model performance. Most approaches that a...
research
09/08/2023

Generating the Ground Truth: Synthetic Data for Label Noise Research

Most real-world classification tasks suffer from label noise to some ext...
research
06/24/2023

Cross-Validation Is All You Need: A Statistical Approach To Label Noise Estimation

Label noise is prevalent in machine learning datasets. It is crucial to ...
research
02/18/2021

Deep Learning for Suicide and Depression Identification with Unsupervised Label Correction

Early detection of suicidal ideation in depressed individuals can allow ...

Please sign up or login with your details

Forgot password? Click here to reset