Instance adaptive adversarial training: Improved accuracy tradeoffs in neural nets

10/17/2019
by   Yogesh Balaji, et al.
13

Adversarial training is by far the most successful strategy for improving robustness of neural networks to adversarial attacks. Despite its success as a defense mechanism, adversarial training fails to generalize well to unperturbed test set. We hypothesize that this poor generalization is a consequence of adversarial training with uniform perturbation radius around every training sample. Samples close to decision boundary can be morphed into a different class under a small perturbation budget, and enforcing large margins around these samples produce poor decision boundaries that generalize poorly. Motivated by this hypothesis, we propose instance adaptive adversarial training – a technique that enforces sample-specific perturbation margins around every training sample. We show that using our approach, test accuracy on unperturbed samples improve with a marginal drop in robustness. Extensive experiments on CIFAR-10, CIFAR-100 and Imagenet datasets demonstrate the effectiveness of our proposed approach.

READ FULL TEXT

page 4

page 12

page 13

research
08/30/2021

Adaptive perturbation adversarial training: based on reinforcement learning

Adversarial training has become the primary method to defend against adv...
research
07/24/2023

Adaptive Certified Training: Towards Better Accuracy-Robustness Tradeoffs

As deep learning models continue to advance and are increasingly utilize...
research
10/04/2022

Strength-Adaptive Adversarial Training

Adversarial training (AT) is proved to reliably improve network's robust...
research
10/03/2022

Stability Analysis and Generalization Bounds of Adversarial Training

In adversarial machine learning, deep neural networks can fit the advers...
research
04/19/2023

Wavelets Beat Monkeys at Adversarial Robustness

Research on improving the robustness of neural networks to adversarial n...
research
01/31/2023

Image Shortcut Squeezing: Countering Perturbative Availability Poisons with Compression

Perturbative availability poisoning (PAP) adds small changes to images t...
research
03/10/2017

Decorrelated Jet Substructure Tagging using Adversarial Neural Networks

We describe a strategy for constructing a neural network jet substructur...

Please sign up or login with your details

Forgot password? Click here to reset