Optimal Transport Model Distributional Robustness

06/07/2023
by   Van-Anh Nguyen, et al.
0

Distributional robustness is a promising framework for training deep learning models that are less vulnerable to adversarial examples and data distribution shifts. Previous works have mainly focused on exploiting distributional robustness in data space. In this work, we explore an optimal transport-based distributional robustness framework on model spaces. Specifically, we examine a model distribution in a Wasserstein ball of a given center model distribution that maximizes the loss. We have developed theories that allow us to learn the optimal robust center model distribution. Interestingly, through our developed theories, we can flexibly incorporate the concept of sharpness awareness into training a single model, ensemble models, and Bayesian Neural Networks by considering specific forms of the center model distribution, such as a Dirac delta distribution over a single model, a uniform distribution over several models, and a general Bayesian Neural Network. Furthermore, we demonstrate that sharpness-aware minimization (SAM) is a specific case of our framework when using a Dirac delta distribution over a single model, while our framework can be viewed as a probabilistic extension of SAM. We conduct extensive experiments to demonstrate the usefulness of our framework in the aforementioned settings, and the results show remarkable improvements in our approaches to the baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/28/2022

Certifying Model Accuracy under Distribution Shifts

Certified robustness in machine learning has primarily focused on advers...
research
10/29/2017

Certifiable Distributional Robustness with Principled Adversarial Training

Neural networks are vulnerable to adversarial examples and researchers h...
research
02/27/2022

A Unified Wasserstein Distributional Robustness Framework for Adversarial Training

It is well-known that deep neural networks (DNNs) are susceptible to adv...
research
03/21/2023

OTJR: Optimal Transport Meets Optimal Jacobian Regularization for Adversarial Robustness

Deep neural networks are widely recognized as being vulnerable to advers...
research
05/12/2021

Autoregressive Optimal Transport Models

Series of distributions indexed by equally spaced time points are ubiqui...
research
03/01/2022

Global-Local Regularization Via Distributional Robustness

Despite superior performance in many situations, deep neural networks ar...
research
01/31/2023

Learning Against Distributional Uncertainty: On the Trade-off Between Robustness and Specificity

Trustworthy machine learning aims at combating distributional uncertaint...

Please sign up or login with your details

Forgot password? Click here to reset