Fine-Grained Neural Network Explanation by Identifying Input Features with Predictive Information

10/04/2021
by   Yang Zhang, et al.
4

One principal approach for illuminating a black-box neural network is feature attribution, i.e. identifying the importance of input features for the network's prediction. The predictive information of features is recently proposed as a proxy for the measure of their importance. So far, the predictive information is only identified for latent features by placing an information bottleneck within the network. We propose a method to identify features with predictive information in the input domain. The method results in fine-grained identification of input features' information and is agnostic to network architecture. The core idea of our method is leveraging a bottleneck on the input that only lets input features associated with predictive latent features pass through. We compare our method with several feature attribution methods using mainstream feature attribution evaluation experiments. The code is publicly available.

READ FULL TEXT
research
04/01/2021

Explaining COVID-19 and Thoracic Pathology Model Predictions by Identifying Informative Input Features

Neural networks have demonstrated remarkable performance in classificati...
research
09/28/2021

Discriminative Attribution from Counterfactuals

We present a method for neural network interpretability by combining fea...
research
03/31/2021

Neural Response Interpretation through the Lens of Critical Pathways

Is critical input information encoded in specific sparse pathways within...
research
11/25/2019

Explaining Neural Networks via Perturbing Important Learned Features

Attributing the output of a neural network to the contribution of given ...
research
04/07/2021

Information Bottleneck Attribution for Visual Explanations of Diagnosis and Prognosis

Visual explanation methods have an important role in the prognosis of th...
research
05/30/2018

How Important Is a Neuron?

The problem of attributing a deep network's prediction to its input/base...
research
01/04/2021

On Baselines for Local Feature Attributions

High-performing predictive models, such as neural nets, usually operate ...

Please sign up or login with your details

Forgot password? Click here to reset