HeroLT: Benchmarking Heterogeneous Long-Tailed Learning

07/17/2023
by   Haohui Wang, et al.
0

Long-tailed data distributions are prevalent in a variety of domains, including finance, e-commerce, biomedical science, and cyber security. In such scenarios, the performance of machine learning models is often dominated by the head categories, while the learning of tail categories is significantly inadequate. Given abundant studies conducted to alleviate the issue, this work aims to provide a systematic view of long-tailed learning with regard to three pivotal angles: (A1) the characterization of data long-tailedness, (A2) the data complexity of various domains, and (A3) the heterogeneity of emerging tasks. To achieve this, we develop the most comprehensive (to the best of our knowledge) long-tailed learning benchmark named HeroLT, which integrates 13 state-of-the-art algorithms and 6 evaluation metrics on 14 real-world benchmark datasets across 4 tasks from 3 domains. HeroLT with novel angles and extensive experiments (264 in total) enables researchers and practitioners to effectively and fairly evaluate newly proposed methods compared with existing baselines on varying types of datasets. Finally, we conclude by highlighting the significant applications of long-tailed learning and identifying several promising future directions. For accessibility and reproducibility, we open-source our benchmark HeroLT and corresponding results at https://github.com/SSSKJ/HeroLT.

READ FULL TEXT
research
05/27/2022

A Survey on Long-Tailed Visual Recognition

The heavy reliance on data is one of the major reasons that currently li...
research
10/09/2021

Deep Long-Tailed Learning: A Survey

Deep long-tailed learning, one of the most challenging problems in visua...
research
05/22/2023

Boosting Long-tailed Object Detection via Step-wise Learning on Smooth-tail Data

Real-world data tends to follow a long-tailed distribution, where the cl...
research
04/06/2021

Adversarial Robustness under Long-Tailed Distribution

Adversarial robustness has attracted extensive studies recently by revea...
research
09/11/2022

Inverse Image Frequency for Long-tailed Image Recognition

The long-tailed distribution is a common phenomenon in the real world. E...
research
07/20/2022

Tackling Long-Tailed Category Distribution Under Domain Shifts

Machine learning models fail to perform well on real-world applications ...
research
12/15/2021

Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

Real-world data often follows a long-tailed distribution, which makes th...

Please sign up or login with your details

Forgot password? Click here to reset