Minimal Explanations for Neural Network Predictions

05/19/2022
by   Ouns El Harzli, et al.
0

Explaining neural network predictions is known to be a challenging problem. In this paper, we propose a novel approach which can be effectively exploited, either in isolation or in combination with other methods, to enhance the interpretability of neural model predictions. For a given input to a trained neural model, our aim is to compute a smallest set of input features so that the model prediction changes when these features are disregarded by setting them to an uninformative baseline value. While computing such minimal explanations is computationally intractable in general for fully-connected neural networks, we show that the problem becomes solvable in polynomial time by a greedy algorithm under mild assumptions on the network's activation functions. We then show that our tractability result extends seamlessly to more advanced neural architectures such as convolutional and graph neural networks. We conduct experiments to showcase the capability of our method for identifying the input features that are essential to the model's prediction.

READ FULL TEXT
research
02/05/2021

CF-GNNExplainer: Counterfactual Explanations for Graph Neural Networks

Graph neural networks (GNNs) have shown increasing promise in real-world...
research
08/27/2023

The inverse problem for neural networks

We study the problem of computing the preimage of a set under a neural n...
research
06/22/2021

Towards Automated Evaluation of Explanations in Graph Neural Networks

Explaining Graph Neural Networks predictions to end users of AI applicat...
research
03/06/2020

Explaining Away Attacks Against Neural Networks

We investigate the problem of identifying adversarial attacks on image-b...
research
07/08/2021

Robust Counterfactual Explanations on Graph Neural Networks

Massive deployment of Graph Neural Networks (GNNs) in high-stake applica...
research
02/13/2022

Learning from Randomly Initialized Neural Network Features

We present the surprising result that randomly initialized neural networ...
research
09/05/2023

Computing SHAP Efficiently Using Model Structure Information

SHAP (SHapley Additive exPlanations) has become a popular method to attr...

Please sign up or login with your details

Forgot password? Click here to reset