On Distinctive Properties of Universal Perturbations

12/31/2021
by   Sung-Min Park, et al.
6

We identify properties of universal adversarial perturbations (UAPs) that distinguish them from standard adversarial perturbations. Specifically, we show that targeted UAPs generated by projected gradient descent exhibit two human-aligned properties: semantic locality and spatial invariance, which standard targeted adversarial perturbations lack. We also demonstrate that UAPs contain significantly less signal for generalization than standard adversarial perturbations – that is, UAPs leverage non-robust features to a smaller extent than standard adversarial perturbations.

READ FULL TEXT

page 5

page 21

page 22

page 23

page 24

page 25

page 26

page 27

research
12/08/2020

Locally optimal detection of stochastic targeted universal adversarial perturbations

Deep learning image classifiers are known to be vulnerable to small adve...
research
08/08/2019

Universal Adversarial Audio Perturbations

We demonstrate the existence of universal adversarial perturbations, whi...
research
06/22/2022

Robust Universal Adversarial Perturbations

Universal Adversarial Perturbations (UAPs) are imperceptible, image-agno...
research
08/04/2020

Can Adversarial Weight Perturbations Inject Neural Backdoors?

Adversarial machine learning has exposed several security hazards of neu...
research
11/15/2018

A Spectral View of Adversarially Robust Features

Given the apparent difficulty of learning models that are robust to adve...
research
08/27/2018

Targeted Nonlinear Adversarial Perturbations in Images and Videos

We introduce a method for learning adversarial perturbations targeted to...
research
06/12/2020

Targeted Adversarial Perturbations for Monocular Depth Prediction

We study the effect of adversarial perturbations on the task of monocula...

Please sign up or login with your details

Forgot password? Click here to reset