You Only Need End-to-End Training for Long-Tailed Recognition

12/11/2021
by   Zhiwei Zhang, et al.
0

The generalization gap on the long-tailed data sets is largely owing to most categories only occupying a few training samples. Decoupled training achieves better performance by training backbone and classifier separately. What causes the poorer performance of end-to-end model training (e.g., logits margin-based methods)? In this work, we identify a key factor that affects the learning of the classifier: the channel-correlated features with low entropy before inputting into the classifier. From the perspective of information theory, we analyze why cross-entropy loss tends to produce highly correlated features on the imbalanced data. In addition, we theoretically analyze and prove its impacts on the gradients of classifier weights, the condition number of Hessian, and logits margin-based approach. Therefore, we firstly propose to use Channel Whitening to decorrelate ("scatter") the classifier's inputs for decoupling the weight update and reshaping the skewed decision boundary, which achieves satisfactory results combined with logits margin-based method. However, when the number of minor classes are large, batch imbalance and more participation in training cause over-fitting of the major classes. We also propose two novel modules, Block-based Relatively Balanced Batch Sampler (B3RS) and Batch Embedded Training (BET) to solve the above problems, which makes the end-to-end training achieve even better performance than decoupled training. Experimental results on the long-tailed classification benchmarks, CIFAR-LT and ImageNet-LT, demonstrate the effectiveness of our method.

READ FULL TEXT

page 13

page 16

research
02/16/2022

Cyclical Focal Loss

The cross-entropy softmax loss is the primary loss function used to trai...
research
07/11/2023

Class Instance Balanced Learning for Long-Tailed Classification

The long-tailed image classification task remains important in the devel...
research
03/27/2022

Long-Tailed Recognition via Weight Balancing

In the real open world, data tends to follow long-tailed class distribut...
research
11/06/2021

Towards Calibrated Model for Long-Tailed Visual Recognition from Prior Perspective

Real-world data universally confronts a severe class-imbalance problem a...
research
02/19/2023

Mutual Exclusive Modulator for Long-Tailed Recognition

The long-tailed recognition (LTR) is the task of learning high-performan...
research
08/29/2023

Robust Long-Tailed Learning via Label-Aware Bounded CVaR

Data in the real-world classification problems are always imbalanced or ...
research
12/03/2022

Leveraging Angular Information Between Feature and Classifier for Long-tailed Learning: A Prediction Reformulation Approach

Deep neural networks still struggle on long-tailed image datasets, and o...

Please sign up or login with your details

Forgot password? Click here to reset