Few-Shot Image Classification Benchmarks are Too Far From Reality: Build Back Better with Semantic Task Sampling

05/10/2022
by   Etienne Bennequin, et al.
3

Every day, a new method is published to tackle Few-Shot Image Classification, showing better and better performances on academic benchmarks. Nevertheless, we observe that these current benchmarks do not accurately represent the real industrial use cases that we encountered. In this work, through both qualitative and quantitative studies, we expose that the widely used benchmark tieredImageNet is strongly biased towards tasks composed of very semantically dissimilar classes e.g. bathtub, cabbage, pizza, schipperke, and cardoon. This makes tieredImageNet (and similar benchmarks) irrelevant to evaluate the ability of a model to solve real-life use cases usually involving more fine-grained classification. We mitigate this bias using semantic information about the classes of tieredImageNet and generate an improved, balanced benchmark. Going further, we also introduce a new benchmark for Few-Shot Image Classification using the Danish Fungi 2020 dataset. This benchmark proposes a wide variety of evaluation tasks with various fine-graininess. Moreover, this benchmark includes many-way tasks (e.g. composed of 100 classes), which is a challenging setting yet very common in industrial applications. Our experiments bring out the correlation between the difficulty of a task and the semantic similarity between its classes, as well as a heavy performance drop of state-of-the-art methods on many-way few-shot classification, raising questions about the scaling abilities of these methods. We hope that our work will encourage the community to further question the quality of standard evaluation processes and their relevance to real-life applications.

READ FULL TEXT

page 2

page 5

research
09/26/2021

Disentangled Feature Representation for Few-shot Image Classification

Learning the generalizable feature representation is critical for few-sh...
research
07/18/2022

Few-shot Fine-grained Image Classification via Multi-Frequency Neighborhood and Double-cross Modulation

Traditional fine-grained image classification typically relies on large-...
research
05/31/2022

FHIST: A Benchmark for Few-shot Classification of Histological Images

Few-shot learning has recently attracted wide interest in image classifi...
research
08/30/2021

Object-aware Long-short-range Spatial Alignment for Few-Shot Fine-Grained Image Classification

The goal of few-shot fine-grained image classification is to recognize r...
research
07/09/2020

Generalized Many-Way Few-Shot Video Classification

Few-shot learning methods operate in low data regimes. The aim is to lea...
research
03/07/2019

Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples

Few-shot classification refers to learning a classifier for new classes ...
research
12/06/2021

Producing augmentation-invariant embeddings from real-life imagery

This article presents an efficient way to produce feature-rich, high-dim...

Please sign up or login with your details

Forgot password? Click here to reset