Seasoning Model Soups for Robustness to Adversarial and Natural Distribution Shifts

02/20/2023
by   Francesco Croce, et al.
0

Adversarial training is widely used to make classifiers robust to a specific threat or adversary, such as ℓ_p-norm bounded perturbations of a given p-norm. However, existing methods for training classifiers robust to multiple threats require knowledge of all attacks during training and remain vulnerable to unseen distribution shifts. In this work, we describe how to obtain adversarially-robust model soups (i.e., linear combinations of parameters) that smoothly trade-off robustness to different ℓ_p-norm bounded adversaries. We demonstrate that such soups allow us to control the type and level of robustness, and can achieve robustness to all threats without jointly training on all of them. In some cases, the resulting model soups are more robust to a given ℓ_p-norm adversary than the constituent model specialized against that same adversary. Finally, we show that adversarially-robust model soups can be a viable tool to adapt to distribution shifts from a few examples.

READ FULL TEXT

page 5

page 6

page 7

page 12

page 13

research
05/24/2023

Robust Classification via a Single Diffusion Model

Recently, diffusion models have been successfully applied to improving a...
research
10/02/2019

ROMark: A Robust Watermarking System Using Adversarial Training

The availability and easy access to digital communication increase the r...
research
09/09/2019

Adversarial Robustness Against the Union of Multiple Perturbation Models

Owing to the susceptibility of deep learning systems to adversarial atta...
research
02/02/2023

On the Robustness of Randomized Ensembles to Adversarial Perturbations

Randomized ensemble classifiers (RECs), where one classifier is randomly...
research
10/22/2018

Cost-Sensitive Robustness against Adversarial Examples

Several recent works have developed methods for training classifiers tha...
research
06/12/2019

A Stratified Approach to Robustness for Randomly Smoothed Classifiers

Strong theoretical guarantees of robustness can be given for ensembles o...

Please sign up or login with your details

Forgot password? Click here to reset