Combating Noisy-Labeled and Imbalanced Data by Two Stage Bi-Dimensional Sample Selection

08/21/2022
by   Yiliang Zhang, et al.
0

Robust learning on noisy-labeled data has been an important task in real applications, because label noise directly leads to the poor generalization of deep learning models. Existing label-noise learning methods usually assume that the ground-truth classes of the training data are balanced. However, the real-world data is often imbalanced, leading to the inconsistency between observed and intrinsic class distribution due to label noises. Distribution inconsistency makes the problem of label-noise learning more challenging because it is hard to distinguish clean samples from noisy samples on the intrinsic tail classes. In this paper, we propose a learning framework for label-noise learning with intrinsically long-tailed data. Specifically, we propose a robust sample selection method called two-stage bi-dimensional sample selection (TBSS) to better separate clean samples from noisy samples, especially for the tail classes. TBSS consists of two new separation metrics to jointly separate samples in each class. Extensive experiments on multiple noisy-labeled datasets with intrinsically long-tailed class distribution demonstrate the effectiveness of our method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/01/2022

Combating Noisy Labels in Long-Tailed Image Classification

Most existing methods that cope with noisy labels usually assume that th...
research
11/20/2022

Learning from Long-Tailed Noisy Data with Sample Selection and Balanced Loss

The success of deep learning depends on large-scale and well-curated tra...
research
07/27/2022

Identifying Hard Noise in Long-Tailed Sample Distribution

Conventional de-noising methods rely on the assumption that all samples ...
research
08/26/2021

Robust Long-Tailed Learning under Label Noise

Long-tailed learning has attracted much attention recently, with the goa...
research
07/03/2023

Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition

In real-world scenarios, collected and annotated data often exhibit the ...
research
07/29/2022

Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels

Deep models trained with noisy labels are prone to over-fitting and stru...
research
12/21/2022

Class Prototype-based Cleaner for Label Noise Learning

Semi-supervised learning based methods are current SOTA solutions to the...

Please sign up or login with your details

Forgot password? Click here to reset