Calibrated ensembles can mitigate accuracy tradeoffs under distribution shift

07/18/2022
by   Ananya Kumar, et al.
0

We often see undesirable tradeoffs in robust machine learning where out-of-distribution (OOD) accuracy is at odds with in-distribution (ID) accuracy: a robust classifier obtained via specialized techniques such as removing spurious features often has better OOD but worse ID accuracy compared to a standard classifier trained via ERM. In this paper, we find that ID-calibrated ensembles – where we simply ensemble the standard and robust models after calibrating on only ID data – outperforms prior state-of-the-art (based on self-training) on both ID and OOD accuracy. On eleven natural distribution shift datasets, ID-calibrated ensembles obtain the best of both worlds: strong ID accuracy and OOD accuracy. We analyze this method in stylized settings, and identify two important conditions for ensembles to perform well both ID and OOD: (1) we need to calibrate the standard and robust models (on ID data, because OOD data is unavailable), (2) OOD has no anticorrelated spurious features.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2020

Neural Ensemble Search for Performant and Calibrated Predictions

Ensembles of neural networks achieve superior performance compared to st...
research
06/27/2022

Agreement-on-the-Line: Predicting the Performance of Neural Networks under Distribution Shift

Recently, Miller et al. showed that a model's in-distribution (ID) accur...
research
06/30/2021

Uncertainty-Aware Learning for Improvements in Image Quality of the Canada-France-Hawaii Telescope

We leverage state-of-the-art machine learning methods and a decade's wor...
research
03/28/2022

Understanding out-of-distribution accuracies through quantifying difficulty of test samples

Existing works show that although modern neural networks achieve remarka...
research
08/02/2023

Handling Communication via APIs for Microservices

Enterprises in their journey to the cloud, want to decompose their monol...
research
07/20/2022

Revisiting Hotels-50K and Hotel-ID

In this paper, we propose revisited versions for two recent hotel recogn...
research
02/17/2022

Data-SUITE: Data-centric identification of in-distribution incongruous examples

Systematic quantification of data quality is critical for consistent mod...

Please sign up or login with your details

Forgot password? Click here to reset