Understanding the Failure Modes of Out-of-Distribution Generalization

10/29/2020
by   Vaishnavh Nagarajan, et al.
0

Empirical studies suggest that machine learning models often rely on features, such as the background, that may be spuriously correlated with the label only during training time, resulting in poor accuracy during test-time. In this work, we identify the fundamental factors that give rise to this behavior, by explaining why models fail this way even in easy-to-learn tasks where one would expect these models to succeed. In particular, through a theoretical study of gradient-descent-trained linear classifiers on some easy-to-learn tasks, we uncover two complementary failure modes. These modes arise from how spurious correlations induce two kinds of skews in the data: one geometric in nature, and another, statistical in nature. Finally, we construct natural modifications of image classification datasets to understand when these failure modes can arise in practice. We also design experiments to isolate the two failure modes when training modern neural networks on these datasets.

READ FULL TEXT

page 24

page 27

page 29

research
10/29/2021

UDIS: Unsupervised Discovery of Bias in Deep Visual Recognition Models

Deep learning models have been shown to learn spurious correlations from...
research
12/10/2020

Comparison of Update and Genetic Training Algorithms in a Memristor Crossbar Perceptron

Memristor-based computer architectures are becoming more attractive as a...
research
07/05/2023

Exploring new ways: Enforcing representational dissimilarity to learn new features and reduce error consistency

Independently trained machine learning models tend to learn similar feat...
research
07/05/2023

Jailbroken: How Does LLM Safety Training Fail?

Large language models trained for safety and harmlessness remain suscept...
research
12/06/2022

Adaptive Testing of Computer Vision Models

Vision models often fail systematically on groups of data that share com...
research
06/29/2022

Distilling Model Failures as Directions in Latent Space

Existing methods for isolating hard subpopulations and spurious correlat...
research
05/08/2023

If it's Provably Secure, It Probably Isn't: Why Learning from Proof Failure is Hard

In this paper we're going to explore the ways in which security proofs c...

Please sign up or login with your details

Forgot password? Click here to reset