Zoom is what you need: An empirical study of the power of zoom and spatial biases in image classification

04/11/2023
by   Mohammad Reza Taesiri, et al.
0

Image classifiers are information-discarding machines, by design. Yet, how these models discard information remains mysterious. We hypothesize that one way for image classifiers to reach high accuracy is to first zoom to the most discriminative region in the image and then extract features from there to predict image labels. We study six popular networks ranging from AlexNet to CLIP and find that proper framing of the input image can lead to the correct classification of 98.91 potential and limits of zoom transforms in image classification and uncover positional biases in various datasets, especially a strong center bias in two popular datasets: ImageNet-A and ObjectNet. Finally, leveraging our insights into the potential of zoom, we propose a state-of-the-art test-time augmentation (TTA) technique that improves classification accuracy by forcing models to explicitly perform zoom-in operations before making predictions. Our method is more interpretable, accurate, and faster than MEMO, a state-of-the-art TTA method. Additionally, we propose ImageNet-Hard, a new benchmark where zooming in alone often does not help state-of-the-art models better label images.

READ FULL TEXT

page 24

page 25

page 30

page 31

page 32

page 33

page 41

page 42

research
08/10/2017

Analysis of Convolutional Neural Networks for Document Image Classification

Convolutional Neural Networks (CNNs) are state-of-the-art models for doc...
research
09/14/2022

DASH: Visual Analytics for Debiasing Image Classification via User-Driven Synthetic Data Augmentation

Image classification models often learn to predict a class based on irre...
research
03/19/2020

Overinterpretation reveals image classification model pathologies

Image classifiers are typically scored on their test set accuracy, but h...
research
05/22/2020

From ImageNet to Image Classification: Contextualizing Progress on Benchmarks

Building rich machine learning datasets in a scalable manner often neces...
research
11/15/2022

Scalar Invariant Networks with Zero Bias

Just like weights, bias terms are the learnable parameters of many popul...
research
05/20/2019

Testing Deep Neural Network based Image Classifiers

Image classification is an important task in today's world with many app...
research
06/30/2020

Classification Confidence Estimation with Test-Time Data-Augmentation

Machine learning plays an increasingly significant role in many aspects ...

Please sign up or login with your details

Forgot password? Click here to reset