The Devil is in the Tails: Fine-grained Classification in the Wild

09/05/2017
by   Grant Van Horn, et al.
0

The world is long-tailed. What does this mean for computer vision and visual recognition? The main two implications are (1) the number of categories we need to consider in applications can be very large, and (2) the number of training examples for most categories can be very small. Current visual recognition algorithms have achieved excellent classification accuracy. However, they require many training examples to reach peak performance, which suggests that long-tailed distributions will not be dealt with well. We analyze this question in the context of eBird, a large fine-grained classification dataset, and a state-of-the-art deep network classification algorithm. We find that (a) peak classification performance on well-represented categories is excellent, (b) given enough data, classification performance suffers only minimally from an increase in the number of classes, (c) classification performance decays precipitously as the number of training examples decreases, (d) surprisingly, transfer learning is virtually absent in current methods. Our findings suggest that our community should come to grips with the question of long tails.

READ FULL TEXT

page 6

page 7

research
08/30/2016

What makes ImageNet good for transfer learning?

The tremendous success of ImageNet-trained deep features on a wide range...
research
07/14/2019

FoodX-251: A Dataset for Fine-grained Food Classification

Food classification is a challenging problem due to the large number of ...
research
06/04/2023

CDLT: A Dataset with Concept Drift and Long-Tailed Distribution for Fine-Grained Visual Categorization

Data is the foundation for the development of computer vision, and the e...
research
03/31/2022

A 23 MW data centre is all you need

The field of machine learning has achieved striking progress in recent y...
research
06/30/2022

Out-of-Distribution Detection for Long-tailed and Fine-grained Skin Lesion Images

Recent years have witnessed a rapid development of automated methods for...
research
08/06/2018

CPlaNet: Enhancing Image Geolocalization by Combinatorial Partitioning of Maps

Image geolocalization is the task of identifying the location depicted i...
research
09/07/2021

Fair Comparison: Quantifying Variance in Resultsfor Fine-grained Visual Categorization

For the task of image classification, researchers work arduously to deve...

Please sign up or login with your details

Forgot password? Click here to reset