Beneficial Perturbations Network for Defending Adversarial Examples

09/27/2020

∙

Adversarial training, in which a network is trained on both adversarial and clean examples, is one of the most trusted defense methods against adversarial attacks. However, there are three major practical difficulties in implementing and deploying this method - expensive in terms of running memory and computation costs; accuracy trade-off between clean and adversarial examples; cannot foresee all adversarial attacks at training time. Here, we present a new solution to ease these three difficulties - Beneficial perturbation Networks (BPN). BPN generates and leverages beneficial perturbations (somewhat opposite to well-known adversarial perturbations) as biases within the parameter space of the network, to neutralize the effects of adversarial perturbations on data samples. Thus, BPN can effectively defend against adversarial examples. Compared to adversarial training, we demonstrate that BPN can significantly reduce the required running memory and computation costs, by generating beneficial perturbations through recycling of the gradients computed from training on clean examples. In addition, BPN can alleviate the accuracy trade-off difficulty and the difficulty of foreseeing multiple attacks, by improving the generalization of the network, thanks to increased diversity of the training set achieved through neutralization between adversarial and beneficial perturbations.

READ FULL TEXT

Beneficial Perturbations Network for Defending Adversarial Examples

Adversarial Training: embedding adversarial perturbations into the parameter space of a neural network to build a robust system

On Evaluation of Adversarial Perturbations for Sequence-to-Sequence Models

Approximate Manifold Defense Against Multiple Adversarial Perturbations

AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification

Rethinking Uncertainty in Deep Learning: Whether and How it Improves Robustness

Mitigating Adversarial Attacks in Deepfake Detection: An Exploration of Perturbation and AI Techniques

Normal vs. Adversarial: Salience-based Analysis of Adversarial Samples for Relation Extraction

Beneficial Perturbations Network for Defending Adversarial Examples

Related Research

Adversarial Training: embedding adversarial perturbations into the parameter space of a neural network to build a robust system

On Evaluation of Adversarial Perturbations for Sequence-to-Sequence Models

Approximate Manifold Defense Against Multiple Adversarial Perturbations

AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification

Rethinking Uncertainty in Deep Learning: Whether and How it Improves Robustness

Mitigating Adversarial Attacks in Deepfake Detection: An Exploration of Perturbation and AI Techniques

Normal vs. Adversarial: Salience-based Analysis of Adversarial Samples for Relation Extraction