Wasserstein Distributional Robustness and Regularization in Statistical Learning

12/17/2017
by   Rui Gao, et al.
0

A central question in statistical learning is to design algorithms that not only perform well on training data, but also generalize to new and unseen data. In this paper, we tackle this question by formulating a distributionally robust stochastic optimization (DRSO) problem, which seeks a solution that minimizes the worst-case expected loss over a family of distributions that are close to the empirical distribution in Wasserstein distances. We establish a connection between such Wasserstein DRSO and regularization. More precisely, we identify a broad class of loss functions, for which the Wasserstein DRSO is asymptotically equivalent to a regularization problem with a gradient-norm penalty. Such relation provides new interpretations for problems involving regularization, including a great number of statistical learning problems and discrete choice models (e.g. multinomial logit). The connection suggests a principled way to regularize high-dimensional, non-convex problems. This is demonstrated through two applications: the training of Wasserstein generative adversarial networks (WGANs) in deep learning, and learning heterogeneous consumer preferences with mixed logit choice model.

READ FULL TEXT

page 18

page 20

research
10/27/2017

Regularization via Mass Transportation

The goal of regression and classification methods in supervised learning...
research
11/02/2021

Understanding Entropic Regularization in GANs

Generative Adversarial Networks are a popular method for learning distri...
research
09/24/2021

Sinkhorn Distributionally Robust Optimization

We study distributionally robust optimization with Sinkorn distance – a ...
research
06/08/2020

Distributional Robustness with IPMs and links to Regularization and GANs

Robustness to adversarial attacks is an important concern due to the fra...
research
09/09/2020

Finite-Sample Guarantees for Wasserstein Distributionally Robust Optimization: Breaking the Curse of Dimensionality

Wasserstein distributionally robust optimization (DRO) aims to find robu...
research
02/04/2023

Interpolation for Robust Learning: Data Augmentation on Geodesics

We propose to study and promote the robustness of a model as per its per...
research
11/24/2017

Wasserstein Introspective Neural Networks

We present Wasserstein introspective neural networks (WINN) that are bot...

Please sign up or login with your details

Forgot password? Click here to reset