Causal ImageNet: How to discover spurious features in Deep Learning?

10/08/2021
by   Sahil Singla, et al.
0

A key reason for the lack of reliability of deep neural networks in the real world is their heavy reliance on spurious input features that are causally unrelated to the true label. Focusing on image classifications, we define causal attributes as the set of visual features that are always a part of the object while spurious attributes are the ones that are likely to co-occur with the object but not a part of it (e.g., attribute “fingers" for class “band aid"). Traditional methods for discovering spurious features either require extensive human annotations (thus, not scalable), or are useful on specific models. In this work, we introduce a scalable framework to discover a subset of spurious and causal visual attributes used in inferences of a general model and localize them on a large number of images with minimal human supervision. Our methodology is based on this key idea: to identify spurious or causal visual attributes used in model predictions, we identify spurious or causal neural features (penultimate layer neurons of a robust model) via limited human supervision (e.g., using top 5 activating images per feature). We then show that these neural feature annotations generalize extremely well to many more images without any human supervision. We use the activation maps for these neural features as the soft masks to highlight spurious or causal visual attributes. Using this methodology, we introduce the Causal Imagenet dataset containing causal and spurious masks for a large set of samples from Imagenet. We assess the performance of several popular Imagenet models and show that they rely heavily on various spurious features in their predictions.

READ FULL TEXT

page 22

page 28

page 29

page 30

page 32

page 33

page 35

page 36

research
03/23/2021

Extracting Causal Visual Features for Limited label Classification

Neural networks trained to classify images do so by identifying features...
research
04/19/2015

DEEP-CARVING: Discovering Visual Attributes by Carving Deep Neural Nets

Most of the approaches for discovering visual attributes in images deman...
research
01/26/2022

A Comprehensive Study of Image Classification Model Sensitivity to Foregrounds, Backgrounds, and Visual Attributes

While datasets with single-label supervision have propelled rapid advanc...
research
03/28/2022

Core Risk Minimization using Salient ImageNet

Deep neural networks can be unreliable in the real world especially when...
research
12/09/2022

Spurious Features Everywhere – Large-Scale Detection of Harmful Spurious Features in ImageNet

Benchmark performance of deep learning classifiers alone is not a reliab...
research
07/25/2016

Automatic Attribute Discovery with Neural Activations

How can a machine learn to recognize visual attributes emerging out of o...
research
07/07/2023

Discovering Variable Binding Circuitry with Desiderata

Recent work has shown that computation in language models may be human-u...

Please sign up or login with your details

Forgot password? Click here to reset