Outlier Robust Adversarial Training

09/10/2023
by   Shu Hu, et al.
1

Supervised learning models are challenged by the intrinsic complexities of training data such as outliers and minority subpopulations and intentional attacks at inference time with adversarial samples. While traditional robust learning methods and the recent adversarial training approaches are designed to handle each of the two challenges, to date, no work has been done to develop models that are robust with regard to the low-quality training data and the potential adversarial attack at inference time simultaneously. It is for this reason that we introduce Outlier Robust Adversarial Training (ORAT) in this work. ORAT is based on a bi-level optimization formulation of adversarial training with a robust rank-based loss function. Theoretically, we show that the learning objective of ORAT satisfies the ℋ-consistency in binary classification, which establishes it as a proper surrogate to adversarial 0/1 loss. Furthermore, we analyze its generalization ability and provide uniform convergence rates in high probability. ORAT can be optimized with a simple algorithm. Experimental evaluations on three benchmark datasets demonstrate the effectiveness and robustness of ORAT in handling outliers and adversarial attacks. Our code is available at https://github.com/discovershu/ORAT.

READ FULL TEXT

page 2

page 11

page 12

page 32

page 33

research
06/27/2023

DSRM: Boost Textual Adversarial Training with Distribution Shift Risk Minimization

Adversarial training is one of the best-performing methods in improving ...
research
07/14/2023

Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation

It is imperative to ensure the robustness of deep learning models in cri...
research
06/18/2022

The Consistency of Adversarial Training for Binary Classification

Robustness to adversarial perturbations is of paramount concern in moder...
research
11/01/2022

Adversarial Training with Complementary Labels: On the Benefit of Gradually Informative Attacks

Adversarial training (AT) with imperfect supervision is significant but ...
research
09/27/2022

Inducing Data Amplification Using Auxiliary Datasets in Adversarial Training

Several recent studies have shown that the use of extra in-distribution ...
research
06/18/2022

Existence and Minimax Theorems for Adversarial Surrogate Risks in Binary Classification

Adversarial training is one of the most popular methods for training met...
research
04/25/2022

A Simple Structure For Building A Robust Model

As deep learning applications, especially programs of computer vision, a...

Please sign up or login with your details

Forgot password? Click here to reset