Feature Space Augmentation for Long-Tailed Data

08/09/2020
by   Peng Chu, et al.
4

Real-world data often follow a long-tailed distribution as the frequency of each class is typically different. For example, a dataset can have a large number of under-represented classes and a few classes with more than sufficient data. However, a model to represent the dataset is usually expected to have reasonably homogeneous performances across classes. Introducing class-balanced loss and advanced methods on data re-sampling and augmentation are among the best practices to alleviate the data imbalance problem. However, the other part of the problem about the under-represented classes will have to rely on additional knowledge to recover the missing information. In this work, we present a novel approach to address the long-tailed problem by augmenting the under-represented classes in the feature space with the features learned from the classes with ample samples. In particular, we decompose the features of each class into a class-generic component and a class-specific component using class activation maps. Novel samples of under-represented classes are then generated on the fly during training stages by fusing the class-specific features from the under-represented classes with the class-generic features from confusing classes. Our results on different datasets such as iNaturalist, ImageNet-LT, Places-LT and a long-tailed version of CIFAR have shown the state of the art performances.

READ FULL TEXT
research
02/28/2022

Long-Tailed Classification with Gradual Balanced Loss and Adaptive Feature Generation

The real-world data distribution is essentially long-tailed, which poses...
research
02/25/2021

FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation

Recent methods for long-tailed instance segmentation still struggle on r...
research
05/01/2021

Breadcrumbs: Adversarial Class-Balanced Sampling for Long-tailed Recognition

The problem of long-tailed recognition, where the number of examples per...
research
03/23/2021

MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition

Real-world training data usually exhibits long-tailed distribution, wher...
research
01/16/2019

Class-Balanced Loss Based on Effective Number of Samples

With the rapid increase of large-scale, real-world datasets, it becomes ...
research
12/30/2022

Delving into Semantic Scale Imbalance

Model bias triggered by long-tailed data has been widely studied. Howeve...
research
07/26/2022

Class-Aware Universum Inspired Re-Balance Learning for Long-Tailed Recognition

Data augmentation for minority classes is an effective strategy for long...

Please sign up or login with your details

Forgot password? Click here to reset