DeepAI AI Chat
Log In Sign Up

Exploring the Hyperparameter Landscape of Adversarial Robustness

by   Evelyn Duesterwald, et al.

Adversarial training shows promise as an approach for training models that are robust towards adversarial perturbation. In this paper, we explore some of the practical challenges of adversarial training. We present a sensitivity analysis that illustrates that the effectiveness of adversarial training hinges on the settings of a few salient hyperparameters. We show that the robustness surface that emerges across these salient parameters can be surprisingly complex and that therefore no effective one-size-fits-all parameter settings exist. We then demonstrate that we can use the same salient hyperparameters as tuning knob to navigate the tension that can arise between robustness and accuracy. Based on these findings, we present a practical approach that leverages hyperparameter optimization techniques for tuning adversarial training to maximize robustness while keeping the loss in accuracy within a defined budget.


page 1

page 2

page 3

page 4


Revisiting Loss Landscape for Adversarial Robustness

The study on improving the robustness of deep neural networks against ad...

Hyper-parameter Tuning for Adversarially Robust Models

This work focuses on the problem of hyper-parameter tuning (HPT) for rob...

Pareto Adversarial Robustness: Balancing Spatial Robustness and Sensitivity-based Robustness

Adversarial robustness, which mainly contains sensitivity-based robustne...

Adversarial Training for EM Classification Networks

We present a novel variant of Domain Adversarial Networks with impactful...

Bag of Tricks for Adversarial Training

Adversarial training (AT) is one of the most effective strategies for pr...

Data Profiling for Adversarial Training: On the Ruin of Problematic Data

Multiple intriguing problems hover in adversarial training, including ro...

On the Loss Landscape of Adversarial Training: Identifying Challenges and How to Overcome Them

We analyze the influence of adversarial training on the loss landscape o...