Uniformly Distributed Category Prototype-Guided Vision-Language Framework for Long-Tail Recognition

08/24/2023
by   Siming Fu, et al.
0

Recently, large-scale pre-trained vision-language models have presented benefits for alleviating class imbalance in long-tailed recognition. However, the long-tailed data distribution can corrupt the representation space, where the distance between head and tail categories is much larger than the distance between two tail categories. This uneven feature space distribution causes the model to exhibit unclear and inseparable decision boundaries on the uniformly distributed test set, which lowers its performance. To address these challenges, we propose the uniformly category prototype-guided vision-language framework to effectively mitigate feature space bias caused by data imbalance. Especially, we generate a set of category prototypes uniformly distributed on a hypersphere. Category prototype-guided mechanism for image-text matching makes the features of different classes converge to these distinct and uniformly distributed category prototypes, which maintain a uniform distribution in the feature space, and improve class boundaries. Additionally, our proposed irrelevant text filtering and attribute enhancement module allows the model to ignore irrelevant noisy text and focus more on key attribute information, thereby enhancing the robustness of our framework. In the image recognition fine-tuning stage, to address the positive bias problem of the learnable classifier, we design the class feature prototype-guided classifier, which compensates for the performance of tail classes while maintaining the performance of head classes. Our method outperforms previous vision-language methods for long-tailed learning work by a large margin and achieves state-of-the-art performance.

READ FULL TEXT
research
08/04/2022

Constructing Balance from Imbalance for Long-tailed Image Recognition

Long-tailed image recognition presents massive challenges to deep learni...
research
05/22/2023

Boosting Long-tailed Object Detection via Step-wise Learning on Smooth-tail Data

Real-world data tends to follow a long-tailed distribution, where the cl...
research
02/25/2020

Deep Representation Learning on Long-tailed Data: A Learnable Embedding Augmentation Perspective

This paper considers learning deep features from long-tailed data. We ob...
research
06/03/2023

Balancing Logit Variation for Long-tailed Semantic Segmentation

Semantic segmentation usually suffers from a long-tail data distribution...
research
08/25/2023

Dual Compensation Residual Networks for Class Imbalanced Learning

Learning generalizable representation and classifier for class-imbalance...
research
12/02/2022

Compound Batch Normalization for Long-tailed Image Classification

Significant progress has been made in learning image classification neur...
research
03/29/2023

FEND: A Future Enhanced Distribution-Aware Contrastive Learning Framework for Long-tail Trajectory Prediction

Predicting the future trajectories of the traffic agents is a gordian te...

Please sign up or login with your details

Forgot password? Click here to reset