Mutual Exclusive Modulator for Long-Tailed Recognition

by   Haixu Long, et al.

The long-tailed recognition (LTR) is the task of learning high-performance classifiers given extremely imbalanced training samples between categories. Most of the existing works address the problem by either enhancing the features of tail classes or re-balancing the classifiers to reduce the inductive bias. In this paper, we try to look into the root cause of the LTR task, i.e., training samples for each class are greatly imbalanced, and propose a straightforward solution. We split the categories into three groups, i.e., many, medium and few, according to the number of training images. The three groups of categories are separately predicted to reduce the difficulty for classification. This idea naturally arises a new problem of how to assign a given sample to the right class groups? We introduce a mutual exclusive modulator which can estimate the probability of an image belonging to each group. Particularly, the modulator consists of a light-weight module and learned with a mutual exclusive objective. Hence, the output probabilities of the modulator encode the data volume clues of the training dataset. They are further utilized as prior information to guide the prediction of the classifier. We conduct extensive experiments on multiple datasets, e.g., ImageNet-LT, Place-LT and iNaturalist 2018 to evaluate the proposed approach. Our method achieves competitive performance compared to the state-of-the-art benchmarks.


page 1

page 2

page 3

page 4


Calibrating Class Activation Maps for Long-Tailed Visual Recognition

Real-world visual recognition problems often exhibit long-tailed distrib...

CUDA: Curriculum of Data Augmentation for Long-Tailed Recognition

Class imbalance problems frequently occur in real-world tasks, and conve...

The Majority Can Help The Minority: Context-rich Minority Oversampling for Long-tailed Classification

The problem of class imbalanced data lies in that the generalization per...

VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition

Deep learning-based models encounter challenges when processing long-tai...

Difficulty-Net: Learning to Predict Difficulty for Long-Tailed Recognition

Long-tailed datasets, where head classes comprise much more training sam...

You Only Need End-to-End Training for Long-Tailed Recognition

The generalization gap on the long-tailed data sets is largely owing to ...

No One Left Behind: Improving the Worst Categories in Long-Tailed Learning

Unlike the case when using a balanced training dataset, the per-class re...

Please sign up or login with your details

Forgot password? Click here to reset