Spurious Features Everywhere – Large-Scale Detection of Harmful Spurious Features in ImageNet

12/09/2022
by   Yannic Neuhaus, et al.
9

Benchmark performance of deep learning classifiers alone is not a reliable predictor for the performance of a deployed model. In particular, if the image classifier has picked up spurious features in the training data, its predictions can fail in unexpected ways. In this paper, we develop a framework that allows us to systematically identify spurious features in large datasets like ImageNet. It is based on our neural PCA components and their visualization. Previous work on spurious features of image classifiers often operates in toy settings or requires costly pixel-wise annotations. In contrast, we validate our results by checking that presence of the harmful spurious feature of a class is sufficient to trigger the prediction of that class. We introduce a novel dataset "Spurious ImageNet" and check how much existing classifiers rely on spurious features.

READ FULL TEXT

page 6

page 12

page 13

page 16

page 17

page 18

page 19

page 21

research
01/12/2022

BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations

Annotating images with pixel-wise labels is a time-consuming and costly ...
research
02/23/2023

A framework for benchmarking class-out-of-distribution detection and its application to ImageNet

When deployed for risk-sensitive tasks, deep neural networks must be abl...
research
10/08/2021

Causal ImageNet: How to discover spurious features in Deep Learning?

A key reason for the lack of reliability of deep neural networks in the ...
research
02/23/2023

What Can We Learn From The Selective Prediction And Uncertainty Estimation Performance Of 523 Imagenet Classifiers

When deployed for risk-sensitive tasks, deep neural networks must includ...
research
11/12/2013

Visualizing and Understanding Convolutional Networks

Large Convolutional Network models have recently demonstrated impressive...
research
10/07/2021

FOCUS: Familiar Objects in Common and Uncommon Settings

Standard training datasets for deep learning often contain objects in co...
research
05/17/2018

Terabyte-scale Deep Multiple Instance Learning for Classification and Localization in Pathology

In the field of computational pathology, the use of decision support sys...

Please sign up or login with your details

Forgot password? Click here to reset