Enhancing Adversarial Training with Feature Separability

05/02/2022
by   Yaxin Li, et al.
2

Deep Neural Network (DNN) are vulnerable to adversarial attacks. As a countermeasure, adversarial training aims to achieve robustness based on the min-max optimization problem and it has shown to be one of the most effective defense strategies. However, in this work, we found that compared with natural training, adversarial training fails to learn better feature representations for either clean or adversarial samples, which can be one reason why adversarial training tends to have severe overfitting issues and less satisfied generalize performance. Specifically, we observe two major shortcomings of the features learned by existing adversarial training methods:(1) low intra-class feature similarity; and (2) conservative inter-classes feature variance. To overcome these shortcomings, we introduce a new concept of adversarial training graph (ATG) with which the proposed adversarial training with feature separability (ATFS) enables to coherently boost the intra-class feature similarity and increase inter-class feature variance. Through comprehensive experiments, we demonstrate that the proposed ATFS framework significantly improves both clean and robust performance.

READ FULL TEXT
research
10/26/2022

Improving Adversarial Robustness with Self-Paced Hard-Class Pair Reweighting

Deep Neural Networks are vulnerable to adversarial attacks. Among many d...
research
06/25/2023

Enhancing Adversarial Training via Reweighting Optimization Trajectory

Despite the fact that adversarial training has become the de facto metho...
research
09/29/2022

Regularizing Neural Network Training via Identity-wise Discriminative Feature Suppression

It is well-known that a deep neural network has a strong fitting capabil...
research
11/03/2018

Learning to Defense by Learning to Attack

Adversarial training provides a principled approach for training robust ...
research
06/17/2019

MixUp as Directional Adversarial Training

In this work, we explain the working mechanism of MixUp in terms of adve...
research
02/17/2023

Revisiting adversarial training for the worst-performing class

Despite progress in adversarial training (AT), there is a substantial ga...
research
12/14/2020

Adaptive Verifiable Training Using Pairwise Class Similarity

Verifiable training has shown success in creating neural networks that a...

Please sign up or login with your details

Forgot password? Click here to reset