Semantic Adversarial Attacks via Diffusion Models

09/14/2023
by   Chenan Wang, et al.
0

Traditional adversarial attacks concentrate on manipulating clean examples in the pixel space by adding adversarial perturbations. By contrast, semantic adversarial attacks focus on changing semantic attributes of clean examples, such as color, context, and features, which are more feasible in the real world. In this paper, we propose a framework to quickly generate a semantic adversarial attack by leveraging recent diffusion models since semantic information is included in the latent space of well-trained diffusion models. Then there are two variants of this framework: 1) the Semantic Transformation (ST) approach fine-tunes the latent space of the generated image and/or the diffusion model itself; 2) the Latent Masking (LM) approach masks the latent space with another target image and local backpropagation-based interpretation methods. Additionally, the ST approach can be applied in either white-box or black-box settings. Extensive experiments are conducted on CelebA-HQ and AFHQ datasets, and our framework demonstrates great fidelity, generalizability, and transferability compared to other baselines. Our approaches achieve approximately 100 as 36.61. Code is available at https://github.com/steven202/semantic_adv_via_dm.

READ FULL TEXT

page 2

page 5

page 6

page 8

page 9

page 10

page 17

page 18

research
05/14/2023

Diffusion Models for Imperceptible and Transferable Adversarial Attack

Many existing adversarial attacks generate L_p-norm perturbations on ima...
research
06/14/2023

On the Robustness of Latent Diffusion Models

Latent diffusion models achieve state-of-the-art performance on a variet...
research
04/10/2023

Generating Adversarial Attacks in the Latent Space

Adversarial attacks in the input (pixel) space typically incorporate noi...
research
05/22/2023

Hierarchical Integration Diffusion Model for Realistic Image Deblurring

Diffusion models (DMs) have recently been introduced in image deblurring...
research
01/06/2020

Generating Semantic Adversarial Examples via Feature Manipulation

The vulnerability of deep neural networks to adversarial attacks has bee...
research
05/22/2023

Latent Magic: An Investigation into Adversarial Examples Crafted in the Semantic Latent Space

Adversarial attacks against Deep Neural Networks(DNN) have been a crutia...
research
02/23/2023

Boosting Adversarial Transferability using Dynamic Cues

The transferability of adversarial perturbations between image models ha...

Please sign up or login with your details

Forgot password? Click here to reset