Take One Gram of Neural Features, Get Enhanced Group Robustness

08/26/2022
by   Simon Roburin, et al.
3

Predictive performance of machine learning models trained with empirical risk minimization (ERM) can degrade considerably under distribution shifts. The presence of spurious correlations in training datasets leads ERM-trained models to display high loss when evaluated on minority groups not presenting such correlations. Extensive attempts have been made to develop methods improving worst-group robustness. However, they require group information for each training input or at least, a validation set with group labels to tune their hyperparameters, which may be expensive to get or unknown a priori. In this paper, we address the challenge of improving group robustness without group annotation during training or validation. To this end, we propose to partition the training dataset into groups based on Gram matrices of features extracted by an “identification” model and to apply robust optimization based on these pseudo-groups. In the realistic context where no group labels are available, our experiments show that our approach not only improves group robustness over ERM but also outperforms all recent baselines

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/19/2021

Just Train Twice: Improving Group Robustness without Training Group Information

Standard training via empirical risk minimization (ERM) can produce mode...
research
01/10/2022

Towards Group Robustness in the presence of Partial Group Labels

Learning invariant representations is an important requirement when trai...
research
04/05/2022

Spread Spurious Attribute: Improving Worst-group Accuracy with Spurious Attribute Estimation

The paradigm of worst-group loss minimization has shown its promise in a...
research
08/01/2023

Is Last Layer Re-Training Truly Sufficient for Robustness to Spurious Correlations?

Models trained with empirical risk minimization (ERM) are known to learn...
research
12/14/2022

Improving group robustness under noisy labels using predictive uncertainty

The standard empirical risk minimization (ERM) can underperform on certa...
research
05/20/2023

Modeling the Q-Diversity in a Min-max Play Game for Robust Optimization

Models trained with empirical risk minimization (ERM) are revealed to ea...
research
02/06/2023

Bitrate-Constrained DRO: Beyond Worst Case Robustness To Unknown Group Shifts

Training machine learning models robust to distribution shifts is critic...

Please sign up or login with your details

Forgot password? Click here to reset