Evaluating counterfactual explanations using Pearl's counterfactual method

01/06/2023
by   Bevan I. Smith, et al.
0

Counterfactual explanations (CEs) are methods for generating an alternative scenario that produces a different desirable outcome. For example, if a student is predicted to fail a course, then counterfactual explanations can provide the student with alternate ways so that they would be predicted to pass. The applications are many. However, CEs are currently generated from machine learning models that do not necessarily take into account the true causal structure in the data. By doing this, bias can be introduced into the CE quantities. I propose in this study to test the CEs using Judea Pearl's method of computing counterfactuals which has thus far, surprisingly, not been seen in the counterfactual explanation (CE) literature. I furthermore evaluate these CEs on three different causal structures to show how the true underlying causal structure affects the CEs that are generated. This study presented a method of evaluating CEs using Pearl's method and it showed, (although using a limited sample size), that thirty percent of the CEs conflicted with those computed by Pearl's method. This shows that we cannot simply trust CEs and it is vital for us to know the true causal structure before we blindly compute counterfactuals using the original machine learning model.

READ FULL TEXT
research
05/19/2019

Explaining Machine Learning Classifiers through Diverse Counterfactual Explanations

Post-hoc explanations of machine learning models are crucial for people ...
research
10/09/2020

A Series of Unfortunate Counterfactual Events: the Role of Time in Counterfactual Explanations

Counterfactual explanations are a prominent example of post-hoc interpre...
research
04/21/2022

Features of Explainability: How users understand counterfactual and causal explanations for categorical and continuous features in XAI

Counterfactual explanations are increasingly used to address interpretab...
research
10/16/2022

CLEAR: Generative Counterfactual Explanations on Graphs

Counterfactual explanations promote explainability in machine learning m...
research
05/31/2021

Counterfactual Invariance to Spurious Correlations: Why and How to Pass Stress Tests

Informally, a `spurious correlation' is the dependence of a model on som...
research
05/23/2023

Counterfactual Augmentation for Multimodal Learning Under Presentation Bias

In real-world machine learning systems, labels are often derived from us...
research
05/03/2021

Prototype-based Counterfactual Explanation for Causal Classification

Counterfactual explanation is one branch of interpretable machine learni...

Please sign up or login with your details

Forgot password? Click here to reset