Improving Black-box Adversarial Attacks with a Transfer-based Prior

06/17/2019
by   Shuyu Cheng, et al.
0

We consider the black-box adversarial setting, where the adversary has to generate adversarial perturbations without access to the target models to compute gradients. Previous methods tried to approximate the gradient either by using a transfer gradient of a surrogate white-box model, or based on the query feedback. However, these methods often suffer from low attack success rates or poor query efficiency since it is non-trivial to estimate the gradient in a high-dimensional space with limited information. To address these problems, we propose a prior-guided random gradient-free (P-RGF) method to improve black-box adversarial attacks, which takes the advantage of a transfer-based prior and the query information simultaneously. The transfer-based prior given by the gradient of a surrogate model is appropriately integrated into our algorithm by an optimal coefficient derived by a theoretical analysis. Extensive experiments demonstrate that our method requires much fewer queries to attack black-box models with higher success rates compared with the alternative state-of-the-art methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/13/2022

Query-Efficient Black-box Adversarial Attacks Guided by a Transfer-based Prior

Adversarial attacks have been extensively studied in recent years since ...
research
11/27/2018

A Frank-Wolfe Framework for Efficient and Effective Adversarial Attacks

Depending on how much information an adversary can access to, adversaria...
research
03/16/2022

Attacking deep networks with surrogate-based adversarial black-box methods is easy

A recent line of work on black-box adversarial attacks has revived the u...
research
10/21/2020

Learning Black-Box Attackers with Transferable Priors and Query Feedback

This paper addresses the challenging black-box adversarial attack proble...
research
05/08/2020

Projection Probability-Driven Black-Box Attack

Generating adversarial examples in a black-box setting retains a signifi...
research
12/12/2020

Query-free Black-box Adversarial Attacks on Graphs

Many graph-based machine learning models are known to be vulnerable to a...
research
10/06/2021

Attack as the Best Defense: Nullifying Image-to-image Translation GANs via Limit-aware Adversarial Attack

With the successful creation of high-quality image-to-image (Img2Img) tr...

Please sign up or login with your details

Forgot password? Click here to reset