GradTail: Learning Long-Tailed Data Using Gradient-based Sample Weighting

01/16/2022
by   Zhao Chen, et al.
0

We propose GradTail, an algorithm that uses gradients to improve model performance on the fly in the face of long-tailed training data distributions. Unlike conventional long-tail classifiers which operate on converged - and possibly overfit - models, we demonstrate that an approach based on gradient dot product agreement can isolate long-tailed data early on during model training and improve performance by dynamically picking higher sample weights for that data. We show that such upweighting leads to model improvements for both classification and regression models, the latter of which are relatively unexplored in the long-tail literature, and that the long-tail examples found by gradient alignment are consistent with our semantic expectations.

READ FULL TEXT

page 2

page 6

page 7

page 12

page 15

research
03/31/2023

SuperDisco: Super-Class Discovery Improves Visual Recognition for the Long-Tail

Modern image classifiers perform well on populated classes, while degrad...
research
08/07/2022

Sample hardness based gradient loss for long-tailed cervical cell detection

Due to the difficulty of cancer samples collection and annotation, cervi...
research
07/20/2023

Long-Tail Theory under Gaussian Mixtures

We suggest a simple Gaussian mixture model for data generation that comp...
research
10/11/2022

The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition

Long-tail distribution is widely spread in real-world applications. Due ...
research
03/11/2021

Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU models

Recent studies indicate that NLU models are prone to rely on shortcut fe...
research
02/13/2022

Surgical Scheduling via Optimization and Machine Learning with Long-Tailed Data

Using data from cardiovascular surgery patients with long and highly var...
research
06/22/2020

ELF: An Early-Exiting Framework for Long-Tailed Classification

The natural world often follows a long-tailed data distribution where on...

Please sign up or login with your details

Forgot password? Click here to reset