Understanding Integrated Gradients with SmoothTaylor for Deep Neural Network Attribution

04/22/2020
by   Gary S. W. Goh, et al.
5

Integrated gradients as an attribution method for deep neural network models offers simple implementability. However, it also suffers from noisiness of explanations, which affects the ease of interpretability. In this paper, we present Smooth Integrated Gradients as a statistically improved attribution method inspired by Taylor's theorem, which does not require a fixed baseline to be chosen. We apply both methods to the image classification problem, using the ILSVRC2012 ImageNet object recognition dataset, and a couple of pretrained image models to generate attribution maps of their predictions. These attribution maps are visualized by saliency maps which can be evaluated qualitatively. We also empirically evaluate them using quantitative metrics such as perturbations-based score drops and multi-scaled total variance. We further propose adaptive noising to optimize for the noise scale hyperparameter value in our proposed method. From our experiments, we find that the Smooth Integrated Gradients approach together with adaptive noising is able to generate better quality saliency maps with lesser noise and higher sensitivity to the relevant points in the input space.

READ FULL TEXT
research
06/06/2019

Segment Integrated Gradients: Better attributions through regions

Saliency methods can aid understanding of deep neural networks. Recent y...
research
06/17/2021

Guided Integrated Gradients: An Adaptive Path Method for Removing Noise

Integrated Gradients (IG) is a commonly used feature attribution method ...
research
04/03/2020

Attribution in Scale and Space

We study the attribution problem [28] for deep networks applied to perce...
research
11/21/2018

Compensated Integrated Gradients to Reliably Interpret EEG Classification

Integrated gradients are widely employed to evaluate the contribution of...
research
06/13/2022

Geometrically Guided Integrated Gradients

Interpretability methods for deep neural networks mainly focus on the se...
research
01/17/2023

Negative Flux Aggregation to Estimate Feature Attributions

There are increasing demands for understanding deep neural networks' (DN...
research
10/18/2019

Understanding Deep Networks via Extremal Perturbations and Smooth Masks

The problem of attribution is concerned with identifying the parts of an...

Please sign up or login with your details

Forgot password? Click here to reset