Robust Generalization against Photon-Limited Corruptions via Worst-Case Sharpness Minimization

03/23/2023
by   Zhuo Huang, et al.
0

Robust generalization aims to tackle the most challenging data distributions which are rare in the training set and contain severe noises, i.e., photon-limited corruptions. Common solutions such as distributionally robust optimization (DRO) focus on the worst-case empirical risk to ensure low training error on the uncommon noisy distributions. However, due to the over-parameterized model being optimized on scarce worst-case data, DRO fails to produce a smooth loss landscape, thus struggling on generalizing well to the test set. Therefore, instead of focusing on the worst-case risk minimization, we propose SharpDRO by penalizing the sharpness of the worst-case distribution, which measures the loss changes around the neighbor of learning parameters. Through worst-case sharpness minimization, the proposed method successfully produces a flat loss curve on the corrupted distributions, thus achieving robust generalization. Moreover, by considering whether the distribution annotation is available, we apply SharpDRO to two problem settings and design a worst-case selection process for robust generalization. Theoretically, we show that SharpDRO has a great convergence guarantee. Experimentally, we simulate photon-limited corruptions using CIFAR10/100 and ImageNet30 datasets and show that SharpDRO exhibits a strong generalization ability against severe corruptions and exceeds well-known baseline methods with large performance gains.

READ FULL TEXT
research
10/24/2022

Sharpness-aware Minimization for Worst Case Optimization

Improvement of worst group performance and generalization performance ar...
research
03/02/2020

Out-of-Distribution Generalization via Risk Extrapolation (REx)

Generalizing outside of the training distribution is an open challenge f...
research
09/05/2022

Learning from a Biased Sample

The empirical risk minimization approach to data-driven decision making ...
research
11/09/2019

How bad is worst-case data if you know where it comes from?

We introduce a framework for studying how distributional assumptions on ...
research
05/22/2019

Learning Robust Options by Conditional Value at Risk Optimization

Options are generally learned by using an inaccurate environment model (...
research
07/04/2017

Robust Optimization for Non-Convex Objectives

We consider robust optimization problems, where the goal is to optimize ...
research
04/09/2022

The Two Dimensions of Worst-case Training and the Integrated Effect for Out-of-domain Generalization

Training with an emphasis on "hard-to-learn" components of the data has ...

Please sign up or login with your details

Forgot password? Click here to reset