Rethink Long-tailed Recognition with Vision Transforms

02/28/2023
by   Zhengzhuo Xu, et al.
0

In the real world, data tends to follow long-tailed distributions w.r.t. class or attribution, motivating the challenging Long-Tailed Recognition (LTR) problem. In this paper, we revisit recent LTR methods with promising Vision Transformers (ViT). We figure out that 1) ViT is hard to train with long-tailed data. 2) ViT learns generalized features in an unsupervised manner, like mask generative training, either on long-tailed or balanced datasets. Hence, we propose to adopt unsupervised learning to utilize long-tailed data. Furthermore, we propose the Predictive Distribution Calibration (PDC) as a novel metric for LTR, where the model tends to simply classify inputs into common classes. Our PDC can measure the model calibration of predictive preferences quantitatively. On this basis, we find many LTR approaches alleviate it slightly, despite the accuracy improvement. Extensive experiments on benchmark datasets validate that PDC reflects the model's predictive preference precisely, which is consistent with the visualization.

READ FULL TEXT

page 2

page 3

research
05/18/2023

Adjusting Logit in Gaussian Form for Long-Tailed Visual Recognition

It is not uncommon that real-world data are distributed with a long tail...
research
12/05/2022

Learning Imbalanced Data with Vision Transformers

The real-world data tends to be heavily imbalanced and severely skew the...
research
12/14/2021

Margin Calibration for Long-Tailed Visual Recognition

The long-tailed class distribution in visual recognition tasks poses gre...
research
03/27/2022

Long-Tailed Recognition via Weight Balancing

In the real open world, data tends to follow long-tailed class distribut...
research
06/10/2022

Balanced Product of Experts for Long-Tailed Recognition

Many real-world recognition problems suffer from an imbalanced or long-t...
research
08/31/2022

Temporal Flow Mask Attention for Open-Set Long-Tailed Recognition of Wild Animals in Camera-Trap Images

Camera traps, unmanned observation devices, and deep learning-based imag...
research
05/05/2021

Iterative Human and Automated Identification of Wildlife Images

Camera trapping is increasingly used to monitor wildlife, but this techn...

Please sign up or login with your details

Forgot password? Click here to reset