Core Risk Minimization using Salient ImageNet

03/28/2022
by   Sahil Singla, et al.
0

Deep neural networks can be unreliable in the real world especially when they heavily use spurious features for their predictions. Recently, Singla Feizi (2022) introduced the Salient Imagenet dataset by annotating and localizing core and spurious features of  52k samples from 232 classes of Imagenet. While this dataset is useful for evaluating the reliance of pretrained models on spurious features, its small size limits its usefulness for training models. In this work, we first introduce the Salient Imagenet-1M dataset with more than 1 million soft masks localizing core and spurious features for all 1000 Imagenet classes. Using this dataset, we first evaluate the reliance of several Imagenet pretrained models (42 total) on spurious features and observe that: (i) transformers are more sensitive to spurious features compared to Convnets, (ii) zero-shot CLIP transformers are highly susceptible to spurious features. Next, we introduce a new learning paradigm called Core Risk Minimization (CoRM) whose objective ensures that the model predicts a class using its core features. We evaluate different computational approaches for solving CoRM and achieve significantly higher (+12 corrupted using noise) with no drop in clean accuracy compared to models trained via Empirical Risk Minimization.

READ FULL TEXT

page 26

page 29

page 31

page 33

page 37

page 38

page 40

page 42

research
10/03/2019

An empirical study of pretrained representations for few-shot classification

Recent algorithms with state-of-the-art few-shot classification results ...
research
10/08/2021

Causal ImageNet: How to discover spurious features in Deep Learning?

A key reason for the lack of reliability of deep neural networks in the ...
research
03/17/2021

Large-Scale Zero-Shot Image Classification from Rich and Diverse Textual Descriptions

We study the impact of using rich and diverse textual descriptions of cl...
research
10/20/2022

Freeze then Train: Towards Provable Representation Learning under Spurious Correlations and Feature Noise

The existence of spurious correlations such as image backgrounds in the ...
research
03/31/2023

Exploring the Limits of Deep Image Clustering using Pretrained Models

We present a general methodology that learns to classify images without ...
research
01/26/2022

A Comprehensive Study of Image Classification Model Sensitivity to Foregrounds, Backgrounds, and Visual Attributes

While datasets with single-label supervision have propelled rapid advanc...
research
03/07/2023

CUDA: Convolution-based Unlearnable Datasets

Large-scale training of modern deep learning models heavily relies on pu...

Please sign up or login with your details

Forgot password? Click here to reset