A Unified Game-Theoretic Interpretation of Adversarial Robustness

11/05/2021
by   Jie Ren, et al.
0

This paper provides a unified view to explain different adversarial attacks and defense methods, i.e. the view of multi-order interactions between input variables of DNNs. Based on the multi-order interaction, we discover that adversarial attacks mainly affect high-order interactions to fool the DNN. Furthermore, we find that the robustness of adversarially trained DNNs comes from category-specific low-order interactions. Our findings provide a potential method to unify adversarial perturbations and robustness, which can explain the existing defense methods in a principle way. Besides, our findings also make a revision of previous inaccurate understanding of the shape bias of adversarially learned features.

READ FULL TEXT
research
03/12/2021

Game-theoretic Understanding of Adversarially Learned Features

This paper aims to understand adversarial attacks and defense from a new...
research
10/07/2022

Game-Theoretic Understanding of Misclassification

This paper analyzes various types of image misclassification from a game...
research
06/21/2021

A Game-Theoretic Taxonomy of Visual Concepts in DNNs

In this paper, we rethink how a DNN encodes visual concepts of different...
research
06/05/2021

Ensemble Defense with Data Diversity: Weak Correlation Implies Strong Robustness

In this paper, we propose a framework of filter-based ensemble of deep n...
research
10/08/2020

A Unified Approach to Interpreting and Boosting Adversarial Transferability

In this paper, we use the interaction inside adversarial perturbations t...
research
07/24/2022

Proving Common Mechanisms Shared by Twelve Methods of Boosting Adversarial Transferability

Although many methods have been proposed to enhance the transferability ...
research
08/22/2022

BARReL: Bottleneck Attention for Adversarial Robustness in Vision-Based Reinforcement Learning

Robustness to adversarial perturbations has been explored in many areas ...

Please sign up or login with your details

Forgot password? Click here to reset