ScanMix: Learning from Severe Label Noise via Semantic Clustering and Semi-Supervised Learning

03/21/2021
by   Ragav Sachdeva, et al.
4

In this paper, we address the problem of training deep neural networks in the presence of severe label noise. Our proposed training algorithm ScanMix, combines semantic clustering with semi-supervised learning (SSL) to improve the feature representations and enable an accurate identification of noisy samples, even in severe label noise scenarios. To be specific, ScanMix is designed based on the expectation maximisation (EM) framework, where the E-step estimates the value of a latent variable to cluster the training images based on their appearance representations and classification results, and the M-step optimises the SSL classification and learns effective feature representations via semantic clustering. In our evaluations, we show state-of-the-art results on standard benchmarks for symmetric, asymmetric and semantic label noise on CIFAR-10 and CIFAR-100, as well as large scale real label noise on WebVision. Most notably, for the benchmarks contaminated with large noise rates (80 above), our results are up to 27 available at https://github.com/ragavsachdeva/ScanMix.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/13/2023

Manifold DivideMix: A Semi-Supervised Contrastive Learning Framework for Severe Label Noise

Deep neural networks have proven to be highly effective when large amoun...
research
06/25/2019

Semi-Supervised Learning with Self-Supervised Networks

Recent advances in semi-supervised learning have shown tremendous potent...
research
06/16/2020

Building One-Shot Semi-supervised (BOSS) Learning up to Fully Supervised Performance

Reaching the performance of fully supervised learning with unlabeled dat...
research
11/11/2020

EvidentialMix: Learning with Combined Open-set and Closed-set Noisy Labels

The efficacy of deep learning depends on large-scale data sets that have...
research
05/29/2023

Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning

Recent advances in robust semi-supervised learning (SSL) typically filte...
research
02/16/2017

Semi-supervised Learning for Discrete Choice Models

We introduce a semi-supervised discrete choice model to calibrate discre...
research
06/20/2018

DEFRAG: Deep Euclidean Feature Representations through Adaptation on the Grassmann Manifold

We propose a novel technique for training deep networks with the objecti...

Please sign up or login with your details

Forgot password? Click here to reset