When does dough become a bagel? Analyzing the remaining mistakes on ImageNet

05/09/2022
by   Vijay Vasudevan, et al.
11

Image classification accuracy on the ImageNet dataset has been a barometer for progress in computer vision over the last decade. Several recent papers have questioned the degree to which the benchmark remains useful to the community, yet innovations continue to contribute gains to performance, with today's largest models achieving 90 progress on ImageNet and provide a more meaningful evaluation for today's state-of-the-art models, we manually review and categorize every remaining mistake that a few top models make in order to provide insight into the long-tail of errors on one of the most benchmarked datasets in computer vision. We focus on the multi-label subset evaluation of ImageNet, where today's best models achieve upwards of 97 half of the supposed mistakes are not mistakes at all, and we uncover new valid multi-labels, demonstrating that, without careful review, we are significantly underestimating the performance of these models. On the other hand, we also find that today's best models still make a significant number of mistakes (40 that are obviously wrong to human reviewers. To calibrate future progress on ImageNet, we provide an updated multi-label evaluation set, and we curate ImageNet-Major: a 68-example "major error" slice of the obvious mistakes made by today's top models – a slice where models should achieve near perfection, but today are far from doing so.

READ FULL TEXT

page 19

page 20

page 21

page 23

page 24

page 25

page 26

page 28

research
06/12/2020

Are we done with ImageNet?

Yes, and no. We ask whether recent progress on the ImageNet classificati...
research
01/13/2021

Re-labeling ImageNet: from Single to Multi-Labels, from Global to Localized Labels

ImageNet has been arguably the most popular image classification benchma...
research
11/23/2021

Multi-label Iterated Learning for Image Classification with Label Ambiguity

Transfer learning from large-scale pre-trained models has become essenti...
research
04/11/2019

Elucidating image-to-set prediction: An analysis of models, losses and datasets

In recent years, we have experienced a flurry of contributions in the mu...
research
09/11/2017

Why Do Deep Neural Networks Still Not Recognize These Images?: A Qualitative Analysis on Failure Cases of ImageNet Classification

In a recent decade, ImageNet has become the most notable and powerful be...
research
07/17/2019

Robustness properties of Facebook's ResNeXt WSL models

We investigate the robustness properties of ResNeXt image recognition mo...
research
12/22/2019

Analyzing ImageNet with Spectral Relevance Analysis: Towards ImageNet un-Hans'ed

Today's machine learning models for computer vision are typically traine...

Please sign up or login with your details

Forgot password? Click here to reset