Long-tail learning with attributes

04/05/2020
by   Dvir Samuel, et al.
8

Learning to classify images with unbalanced class distributions is challenged by two effects: It is hard to learn tail classes that have few samples, and it is hard to adapt a single model to both richly-sampled and poorly-sampled classes. To address few-shot learning of tail classes, it is useful to fuse additional information in the form of semantic attributes and classify based on multi-modal information. Unfortunately, as we show below, unbalanced data leads to a "familiarity bias", where classifiers favor sample-rich classes. This bias and lack of calibrated predictions make it hard to fuse correctly information from multiple modalities like vision and attributes. Here we describe DRAGON, a novel modular architecture for long-tail learning designed to address these biases and fuse multi-modal information in face of unbalanced data. Our architecture is based on three classifiers: a vision expert, a semantic attribute expert that excels on the tail classes, and a debias-and-fuse module to combine their predictions. We present the first benchmark for long-tail learning with attributes and use it to evaluate DRAGON. DRAGON outperforms state-of-the-art long-tail learning models and Generalized Few-Shot-Learning with attributes (GFSL-a) models. DRAGON also obtains SoTA in some existing benchmarks for single-modality GFSL.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2019

Learning Classifier Synthesis for Generalized Few-Shot Learning

Visual recognition in real-world requires handling long-tailed and even ...
research
12/24/2018

Domain-Aware Generalized Zero-Shot Learning

Generalized zero-shot learning (GZSL) is the problem of learning a class...
research
11/28/2022

Long-tail Cross Modal Hashing

Existing Cross Modal Hashing (CMH) methods are mainly designed for balan...
research
04/03/2023

Use Your Head: Improving Long-Tail Video Recognition

This paper presents an investigation into long-tail video recognition. W...
research
07/04/2022

DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning

Zero-shot learning (ZSL) aims to predict unseen classes whose samples ha...
research
11/07/2021

Meta Cross-Modal Hashing on Long-Tailed Data

Due to the advantage of reducing storage while speeding up query time on...
research
03/22/2021

Intersection Regularization for Extracting Semantic Attributes

We consider the problem of supervised classification, such that the feat...

Please sign up or login with your details

Forgot password? Click here to reset