Explaining Deep Neural Networks with a Polynomial Time Algorithm for Shapley Values Approximation

03/26/2019
by   Marco Ancona, et al.
14

The problem of explaining the behavior of deep neural networks has gained a lot of attention over the last years. While several attribution methods have been proposed, most come without strong theoretical foundations. This raises the question of whether the resulting attributions are reliable. On the other hand, the literature on cooperative game theory suggests Shapley values as a unique way of assigning relevance scores such that certain desirable properties are satisfied. Previous works on attribution methods also showed that explanations based on Shapley values better agree with the human intuition. Unfortunately, the exact evaluation of Shapley values is prohibitively expensive, exponential in the number of input features. In this work, by leveraging recent results on uncertainty propagation, we propose a novel, polynomial-time approximation of Shapley values in deep neural networks. We show that our method produces significantly better approximations of Shapley values than existing state-of-the-art attribution methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/16/2017

A unified view of gradient-based attribution methods for Deep Neural Networks

Understanding the flow of information in Deep Neural Networks is a chall...
research
09/01/2021

Spatio-Temporal Perturbations for Video Attribution

The attribution method provides a direction for interpreting opaque neur...
research
06/26/2022

Explaining the root causes of unit-level changes

Existing methods of explainable AI and interpretable ML cannot explain c...
research
04/04/2023

HarsanyiNet: Computing Accurate Shapley Values in a Single Forward Propagation

The Shapley value is widely regarded as a trustworthy attribution metric...
research
09/05/2023

Computing SHAP Efficiently Using Model Structure Information

SHAP (SHapley Additive exPlanations) has become a popular method to attr...
research
06/14/2022

Machines Explaining Linear Programs

There has been a recent push in making machine learning models more inte...
research
11/27/2017

DeepAPT: Nation-State APT Attribution Using End-to-End Deep Neural Networks

In recent years numerous advanced malware, aka advanced persistent threa...

Please sign up or login with your details

Forgot password? Click here to reset