Predictive and Causal Implications of using Shapley Value for Model Interpretation

08/12/2020
by   Sisi Ma, et al.
0

Shapley value is a concept from game theory. Recently, it has been used for explaining complex models produced by machine learning techniques. Although the mathematical definition of Shapley value is straight-forward, the implication of using it as a model interpretation tool is yet to be described. In the current paper, we analyzed Shapley value in the Bayesian network framework. We established the relationship between Shapley value and conditional independence, a key concept in both predictive and causal modeling. Our results indicate that, eliminating a variable with high Shapley value from a model do not necessarily impair predictive performance, whereas eliminating a variable with low Shapley value from a model could impair performance. Therefore, using Shapley value for feature selection do not result in the most parsimonious and predictively optimal model in the general case. More importantly, Shapley value of a variable do not reflect their causal relationship with the target of interest.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/28/2012

Extension of Three-Variable Counterfactual Casual Graphic Model: from Two-Value to Three-Value Random Variable

The extension of counterfactual causal graphic model with three variable...
research
01/21/2020

Nonparametric Causal Feature Selection for Spatiotemporal Risk Mapping of Malaria Incidence in Madagascar

Modern disease mapping uses high resolution environmental and socioecono...
research
02/11/2022

The Shapley Value in Machine Learning

Over the last few years, the Shapley value, a solution concept from coop...
research
06/09/2023

Explaining Predictive Uncertainty with Information Theoretic Shapley Values

Researchers in explainable artificial intelligence have developed numero...
research
05/22/2021

Learning Baseline Values for Shapley Values

This paper aims to formulate the problem of estimating the optimal basel...
research
08/08/2018

L-Shapley and C-Shapley: Efficient Model Interpretation for Structured Data

We study instancewise feature importance scoring as a method for model i...
research
09/25/2018

Why scatter plots suggest causality, and what we can do about it

Scatter plots carry an implicit if subtle message about causality. Wheth...

Please sign up or login with your details

Forgot password? Click here to reset