On Feature Learning in the Presence of Spurious Correlations

10/20/2022
by   Pavel Izmailov, et al.
0

Deep classifiers are known to rely on spurious features x2013 patterns which are correlated with the target on the training data but not inherently relevant to the learning problem, such as the image backgrounds when classifying the foregrounds. In this paper we evaluate the amount of information about the core (non-spurious) features that can be decoded from the representations learned by standard empirical risk minimization (ERM) and specialized group robustness training. Following recent work on Deep Feature Reweighting (DFR), we evaluate the feature representations by re-training the last layer of the model on a held-out set where the spurious correlation is broken. On multiple vision and NLP problems, we show that the features learned by simple ERM are highly competitive with the features learned by specialized group robustness methods targeted at reducing the effect of spurious correlations. Moreover, we show that the quality of learned feature representations is greatly affected by the design decisions beyond the training method, such as the model architecture and pre-training strategy. On the other hand, we find that strong regularization is not necessary for learning high quality feature representations. Finally, using insights from our analysis, we significantly improve upon the best results reported in the literature on the popular Waterbirds, CelebA hair color prediction and WILDS-FMOW problems, achieving 97

READ FULL TEXT

page 19

page 22

research
12/20/2013

Learned versus Hand-Designed Feature Representations for 3d Agglomeration

For image recognition and labeling tasks, recent results suggest that ma...
research
06/19/2023

Simple and Fast Group Robustness by Automatic Feature Reweighting

A major challenge to out-of-distribution generalization is reliance on s...
research
10/20/2022

Freeze then Train: Towards Provable Representation Learning under Spurious Correlations and Feature Noise

The existence of spurious correlations such as image backgrounds in the ...
research
04/22/2023

Towards Understanding Feature Learning in Out-of-Distribution Generalization

A common explanation for the failure of out-of-distribution (OOD) genera...
research
05/25/2023

Feature Collapse

We formalize and study a phenomenon called feature collapse that makes p...
research
03/12/2019

Activation Analysis of a Byte-Based Deep Neural Network for Malware Classification

Feature engineering is one of the most costly aspects of developing effe...
research
06/10/2021

Curiously Effective Features for Image Quality Prediction

The performance of visual quality prediction models is commonly assumed ...

Please sign up or login with your details

Forgot password? Click here to reset