Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples

02/01/2018
by   Anish Athalye, et al.
0

We identify obfuscated gradients as a phenomenon that leads to a false sense of security in defenses against adversarial examples. While defenses that cause obfuscated gradients appear to defeat optimization-based attacks, we find defenses relying on this effect can be circumvented. For each of the three types of obfuscated gradients we discover, we describe indicators of defenses exhibiting this effect and develop attack techniques to overcome it. In a case study, examining all defenses accepted to ICLR 2018, we find obfuscated gradients are a common occurrence, with 7 of 8 defenses relying on obfuscated gradients. Using our new attack techniques, we successfully circumvent all 7 of them.

READ FULL TEXT
research
10/23/2018

Stochastic Substitute Training: A Gray-box Approach to Craft Adversarial Examples Against Gradient Obfuscation Defenses

It has been shown that adversaries can craft example inputs to neural ne...
research
06/03/2022

Gradient Obfuscation Checklist Test Gives a False Sense of Security

One popular group of defense techniques against adversarial attacks is b...
research
04/10/2022

Measuring the False Sense of Security

Recently, several papers have demonstrated how widespread gradient maski...
research
01/10/2022

Evaluation of Neural Networks Defenses and Attacks using NDCG and Reciprocal Rank Metrics

The problem of attacks on neural networks through input modification (i....
research
02/17/2023

Measuring Equality in Machine Learning Security Defenses

The machine learning security community has developed myriad defenses fo...
research
06/18/2021

Indicators of Attack Failure: Debugging and Improving Optimization of Adversarial Examples

Evaluating robustness of machine-learning models to adversarial examples...
research
03/15/2022

SoK: Why Have Defenses against Social Engineering Attacks Achieved Limited Success?

Social engineering attacks are a major cyber threat because they often s...

Please sign up or login with your details

Forgot password? Click here to reset