Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition

by   Chao Liang, et al.

In real-world scenarios, collected and annotated data often exhibit the characteristics of multiple classes and long-tailed distribution. Additionally, label noise is inevitable in large-scale annotations and hinders the applications of learning-based models. Although many deep learning based methods have been proposed for handling long-tailed multi-label recognition or label noise respectively, learning with noisy labels in long-tailed multi-label visual data has not been well-studied because of the complexity of long-tailed distribution entangled with multi-label correlation. To tackle such a critical yet thorny problem, this paper focuses on reducing noise based on some inherent properties of multi-label classification and long-tailed learning under noisy cases. In detail, we propose a Stitch-Up augmentation to synthesize a cleaner sample, which directly reduces multi-label noise by stitching up multiple noisy training samples. Equipped with Stitch-Up, a Heterogeneous Co-Learning framework is further designed to leverage the inconsistency between long-tailed and balanced distributions, yielding cleaner labels for more robust representation learning with noisy long-tailed data. To validate our method, we build two challenging benchmarks, named VOC-MLT-Noise and COCO-MLT-Noise, respectively. Extensive experiments are conducted to demonstrate the effectiveness of our proposed method. Compared to a variety of baselines, our method achieves superior results.


page 1

page 3

page 11


Combating Noisy-Labeled and Imbalanced Data by Two Stage Bi-Dimensional Sample Selection

Robust learning on noisy-labeled data has been an important task in real...

Robust Long-Tailed Learning under Label Noise

Long-tailed learning has attracted much attention recently, with the goa...

Robust Asymmetric Loss for Multi-Label Long-Tailed Learning

In real medical data, training samples typically show long-tailed distri...

DOST – Domain Obedient Self-supervised Training for Multi Label Classification with Noisy Labels

The enormous demand for annotated data brought forth by deep learning te...

Bag of Tricks for Long-Tailed Multi-Label Classification on Chest X-Rays

Clinical classification of chest radiography is particularly challenging...

The Devil is in the Tails: How Long-Tailed Code Distributions Impact Large Language Models

Learning-based techniques, especially advanced Large Language Models (LL...

Data Efficient Training with Imbalanced Label Sample Distribution for Fashion Detection

Multi-label classification models have a wide range of applications in E...

Please sign up or login with your details

Forgot password? Click here to reset