Auditing Visualizations: Transparency Methods Struggle to Detect Anomalous Behavior

06/27/2022
by   Jean-Stanislas Denain, et al.
0

Transparency methods such as model visualizations provide information that outputs alone might miss, since they describe the internals of neural networks. But can we trust that model explanations reflect model behavior? For instance, can they diagnose abnormal behavior such as backdoors or shape bias? To evaluate model explanations, we define a model as anomalous if it differs from a reference set of normal models, and we test whether transparency methods assign different explanations to anomalous and normal models. We find that while existing methods can detect stark anomalies such as shape bias or adversarial training, they struggle to identify more subtle anomalies such as models trained on incomplete data. Moreover, they generally fail to distinguish the inputs that induce anomalous behavior, e.g. images containing a backdoor trigger. These results reveal new blind spots in existing model explanations, pointing to the need for further method development.

READ FULL TEXT

page 4

page 6

page 9

page 15

research
07/22/2023

Multi-representations Space Separation based Graph-level Anomaly-aware Detection

Graph structure patterns are widely used to model different area data re...
research
04/28/2022

Anomaly Detection by Leveraging Incomplete Anomalous Knowledge with Anomaly-Aware Bidirectional GANs

The goal of anomaly detection is to identify anomalous samples from norm...
research
12/20/2015

ATD: Anomalous Topic Discovery in High Dimensional Discrete Data

We propose an algorithm for detecting patterns exhibited by anomalous cl...
research
04/25/2021

Unsupervised Learning of Multi-level Structures for Anomaly Detection

The main difficulty in high-dimensional anomaly detection tasks is the l...
research
08/13/2020

LAC : LSTM AUTOENCODER with Community for Insider Threat Detection

The employees of any organization, institute, or industry, spend a signi...
research
02/15/2018

Detecting Anomalous Faces with 'No Peeking' Autoencoders

Detecting anomalous faces has important applications. For example, a sys...
research
06/02/2020

An Alternative Metric for Detecting Anomalous Ship Behavior Using a Variation of the DBSCAN Clustering Algorithm

There is a growing need to quickly and accurately identify anomalous beh...

Please sign up or login with your details

Forgot password? Click here to reset