On the Connection between Game-Theoretic Feature Attributions and Counterfactual Explanations

07/13/2023
by   Emanuele Albini, et al.
0

Explainable Artificial Intelligence (XAI) has received widespread interest in recent years, and two of the most popular types of explanations are feature attributions, and counterfactual explanations. These classes of approaches have been largely studied independently and the few attempts at reconciling them have been primarily empirical. This work establishes a clear theoretical connection between game-theoretic feature attributions, focusing on but not limited to SHAP, and counterfactuals explanations. After motivating operative changes to Shapley values based feature attributions and counterfactual explanations, we prove that, under conditions, they are in fact equivalent. We then extend the equivalency result to game-theoretic solution concepts beyond Shapley values. Moreover, through the analysis of the conditions of such equivalence, we shed light on the limitations of naively using counterfactual explanations to provide feature importances. Experiments on three datasets quantitatively show the difference in explanations at every stage of the connection between the two approaches and corroborate the theoretical findings.

READ FULL TEXT

page 18

page 19

page 20

research
05/17/2021

Convex optimization for actionable & plausible counterfactual explanations

Transparency is an essential requirement of machine learning based decis...
research
02/26/2021

If Only We Had Better Counterfactual Explanations: Five Key Deficits to Rectify in the Evaluation of Counterfactual XAI Techniques

In recent years, there has been an explosion of AI research on counterfa...
research
03/29/2022

Diffusion Models for Counterfactual Explanations

Counterfactual explanations have shown promising results as a post-hoc f...
research
06/09/2022

A Learning-Theoretic Framework for Certified Auditing of Machine Learning Models

Responsible use of machine learning requires that models be audited for ...
research
10/27/2021

Counterfactual Shapley Additive Explanations

Feature attributions are a common paradigm for model explanations due to...
research
02/22/2021

Mutual information-based group explainers with coalition structure for machine learning model explanations

In this article, we propose and investigate ML group explainers in a gen...
research
09/21/2023

Quantifying Feature Importance of Games and Strategies via Shapley Values

Recent advances in game informatics have enabled us to find strong strat...

Please sign up or login with your details

Forgot password? Click here to reset