Long-Tailed Recognition via Weight Balancing

03/27/2022
by   Shaden Alshammari, et al.
0

In the real open world, data tends to follow long-tailed class distributions, motivating the well-studied long-tailed recognition (LTR) problem. Naive training produces models that are biased toward common classes in terms of higher accuracy. The key to addressing LTR is to balance various aspects including data distribution, training losses, and gradients in learning. We explore an orthogonal direction, weight balancing, motivated by the empirical observation that the naively trained classifier has "artificially" larger weights in norm for common classes (because there exists abundant data to train them, unlike the rare classes). We investigate three techniques to balance weights, L2-normalization, weight decay, and MaxNorm. We first point out that L2-normalization "perfectly" balances per-class weights to be unit norm, but such a hard constraint might prevent classes from learning better classifiers. In contrast, weight decay penalizes larger weights more heavily and so learns small balanced weights; the MaxNorm constraint encourages growing small weights within a norm ball but caps all the weights by the radius. Our extensive study shows that both help learn balanced weights and greatly improve the LTR accuracy. Surprisingly, weight decay, although underexplored in LTR, significantly improves over prior work. Therefore, we adopt a two-stage training paradigm and propose a simple approach to LTR: (1) learning features using the cross-entropy loss by tuning weight decay, and (2) learning classifiers using class-balanced loss by tuning weight decay and MaxNorm. Our approach achieves the state-of-the-art accuracy on five standard benchmarks, serving as a future baseline for long-tailed recognition.

READ FULL TEXT

page 1

page 3

research
05/26/2023

Exploring Weight Balancing on Long-Tailed Recognition Problem

Recognition problems in long-tailed data, where the sample size per clas...
research
08/04/2023

RAHNet: Retrieval Augmented Hybrid Network for Long-tailed Graph Classification

Graph classification is a crucial task in many real-world multimedia app...
research
02/01/2023

Learning Prototype Classifiers for Long-Tailed Recognition

The problem of long-tailed recognition (LTR) has received attention in r...
research
12/11/2021

You Only Need End-to-End Training for Long-Tailed Recognition

The generalization gap on the long-tailed data sets is largely owing to ...
research
02/28/2023

Rethink Long-tailed Recognition with Vision Transforms

In the real world, data tends to follow long-tailed distributions w.r.t....
research
12/27/2020

Understanding Decoupled and Early Weight Decay

Weight decay (WD) is a traditional regularization technique in deep lear...
research
12/03/2022

Leveraging Angular Information Between Feature and Classifier for Long-tailed Learning: A Prediction Reformulation Approach

Deep neural networks still struggle on long-tailed image datasets, and o...

Please sign up or login with your details

Forgot password? Click here to reset