An Investigation of Why Overparameterization Exacerbates Spurious Correlations

05/09/2020
by   Shiori Sagawa, et al.
0

We study why overparameterization – increasing model size well beyond the point of zero training error – can hurt test error on minority groups despite improving average test error when there are spurious correlations in the data. Through simulations and experiments on two image datasets, we identify two key properties of the training data that drive this behavior: the proportions of majority versus minority groups, and the signal-to-noise ratio of the spurious correlations. We then analyze a linear setting and show theoretically how the inductive bias of models towards "memorizing" fewer examples can cause overparameterization to hurt. Our analysis leads to a counterintuitive approach of subsampling the majority group, which empirically achieves low minority error in the overparameterized regime, even though the standard approach of upweighting the minority fails. Overall, our results suggest a tension between using overparameterized models versus using all the training data for achieving low worst-group error.

READ FULL TEXT
research
11/20/2019

Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization

Overparameterized neural networks can be highly accurate on average on a...
research
04/14/2020

Contrastive Examples for Addressing the Tyranny of the Majority

Computer vision algorithms, e.g. for face recognition, favour groups of ...
research
05/30/2023

Identifying Spurious Biases Early in Training through the Lens of Simplicity Bias

Neural networks trained with (stochastic) gradient descent have an induc...
research
05/23/2022

Throwing Away Data Improves Worst-Class Error in Imbalanced Classification

Class imbalances pervade classification problems, yet their treatment di...
research
06/21/2023

Which Spurious Correlations Impact Reasoning in NLI Models? A Visual Interactive Diagnosis through Data-Constrained Counterfactuals

We present a human-in-the-loop dashboard tailored to diagnosing potentia...
research
09/01/2022

MIME: Minority Inclusion for Majority Group Enhancement of AI Performance

Several papers have rightly included minority groups in artificial intel...
research
06/29/2022

When Does Group Invariant Learning Survive Spurious Correlations?

By inferring latent groups in the training data, recent works introduce ...

Please sign up or login with your details

Forgot password? Click here to reset