Just Mix Once: Worst-group Generalization by Group Interpolation

10/21/2022
by   Giorgio Giannone, et al.
0

Advances in deep learning theory have revealed how average generalization relies on superficial patterns in data. The consequences are brittle models with poor performance with shift in group distribution at test time. When group annotation is available, we can use robust optimization tools to tackle the problem. However, identification and annotation are time-consuming, especially on large datasets. A recent line of work leverages self-supervision and oversampling to improve generalization on minority groups without group annotation. We propose to unify and generalize these approaches using a class-conditional variant of mixup tailored for worst-group generalization. Our approach, Just Mix Once (JM1), interpolates samples during learning, augmenting the training distribution with a continuous mixture of groups. JM1 is domain agnostic and computationally efficient, can be used with any level of group annotation, and performs on par or better than the state-of-the-art on worst-group generalization. Additionally, we provide a simple explanation of why JM1 works.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/20/2023

Modeling the Q-Diversity in a Min-max Play Game for Robust Optimization

Models trained with empirical risk minimization (ERM) are revealed to ea...
research
12/31/2021

BARACK: Partially Supervised Group Robustness With Guarantees

While neural networks have shown remarkable success on classification ta...
research
11/20/2019

Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization

Overparameterized neural networks can be highly accurate on average on a...
research
10/06/2021

Focus on the Common Good: Group Distributional Robustness Follows

We consider the problem of training a classification model with group an...
research
10/24/2022

Sharpness-aware Minimization for Worst Case Optimization

Improvement of worst group performance and generalization performance ar...
research
02/25/2021

An Online Learning Approach to Interpolation and Extrapolation in Domain Generalization

A popular assumption for out-of-distribution generalization is that the ...
research
02/15/2022

Learning to Solve Routing Problems via Distributionally Robust Optimization

Recent deep models for solving routing problems always assume a single d...

Please sign up or login with your details

Forgot password? Click here to reset