Visual Analytics of Neuron Vulnerability to Adversarial Attacks on Convolutional Neural Networks

03/06/2023
by   Yiran Li, et al.
0

Adversarial attacks on a convolutional neural network (CNN) – injecting human-imperceptible perturbations into an input image – could fool a high-performance CNN into making incorrect predictions. The success of adversarial attacks raises serious concerns about the robustness of CNNs, and prevents them from being used in safety-critical applications, such as medical diagnosis and autonomous driving. Our work introduces a visual analytics approach to understanding adversarial attacks by answering two questions: (1) which neurons are more vulnerable to attacks and (2) which image features do these vulnerable neurons capture during the prediction? For the first question, we introduce multiple perturbation-based measures to break down the attacking magnitude into individual CNN neurons and rank the neurons by their vulnerability levels. For the second, we identify image features (e.g., cat ears) that highly stimulate a user-selected neuron to augment and validate the neuron's responsibility. Furthermore, we support an interactive exploration of a large number of neurons by aiding with hierarchical clustering based on the neurons' roles in the prediction. To this end, a visual analytics system is designed to incorporate visual reasoning for interpreting adversarial attacks. We validate the effectiveness of our system through multiple case studies as well as feedback from domain experts.

READ FULL TEXT

page 7

page 14

page 15

page 17

page 18

page 19

page 20

page 21

research
06/13/2023

Finite Gaussian Neurons: Defending against adversarial attacks by making neural networks say "I don't know"

Since 2014, artificial neural networks have been known to be vulnerable ...
research
02/23/2020

Neuron Shapley: Discovering the Responsible Neurons

We develop Neuron Shapley as a new framework to quantify the contributio...
research
03/29/2023

A Tensor-based Convolutional Neural Network for Small Dataset Classification

Inspired by the ConvNets with structured hidden representations, we prop...
research
10/18/2020

What do CNN neurons learn: Visualization Clustering

In recent years convolutional neural networks (CNN) have shown striking ...
research
10/14/2021

Interactive Analysis of CNN Robustness

While convolutional neural networks (CNNs) have found wide adoption as s...
research
12/24/2021

NIP: Neuron-level Inverse Perturbation Against Adversarial Attacks

Although deep learning models have achieved unprecedented success, their...
research
09/04/2023

Toward Defensive Letter Design

A major approach for defending against adversarial attacks aims at contr...

Please sign up or login with your details

Forgot password? Click here to reset