Asymmetric Shapley values: incorporating causal knowledge into model-agnostic explainability

10/14/2019
by   Christopher Frye, et al.
0

Explaining AI systems is fundamental both to the development of high performing models and to the trust placed in them by their users. A general framework for explaining any AI model is provided by the Shapley values that attribute the prediction output to the various model inputs ("features") in a principled and model-agnostic way. The outstanding strength of Shapley values is their combined generality and rigorous foundation: they can be used to explain any AI system, and one always understands their values as the unique attribution method satisfying a set of mathematical axioms. However, as a framework, Shapley values are too restrictive in one significant regard: they ignore all causal structure in the data. We introduce a less-restrictive framework for model-agnostic explainability: "Asymmetric" Shapley values. Asymmetric Shapley values (ASVs) are rigorously founded on a set of axioms, applicable to any AI system, and can flexibly incorporate any causal knowledge known a-priori to be respected by the data. We show through explicit, realistic examples that the ASV framework can be used to (i) improve model explanations by incorporating causal information, (ii) provide an unambiguous test for unfair discrimination based on simple policy articulations, (iii) enable sequentially incremental explanations in time-series models, and (iv) support feature-selection studies without the need for model retraining.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2020

Shapley-based explainability on the data manifold

Explainability in machine learning is crucial for iterative model develo...
research
11/03/2020

Causal Shapley Values: Exploiting Causal Knowledge to Explain Individual Predictions of Complex Models

Shapley values underlie one of the most popular model-agnostic methods w...
research
11/17/2022

Explainability Via Causal Self-Talk

Explaining the behavior of AI systems is an important problem that, in p...
research
03/02/2020

A general framework for scientifically inspired explanations in AI

Explainability in AI is gaining attention in the computer science commun...
research
11/11/2022

WindowSHAP: An Efficient Framework for Explaining Time-series Classifiers based on Shapley Values

Unpacking and comprehending how deep learning algorithms make decisions ...
research
06/15/2023

Improving Explainability of Disentangled Representations using Multipath-Attribution Mappings

Explainable AI aims to render model behavior understandable by humans, w...
research
12/07/2021

Tell me why! – Explanations support learning of relational and causal structure

Explanations play a considerable role in human learning, especially in a...

Please sign up or login with your details

Forgot password? Click here to reset