Benchmark Evaluation of Counterfactual Algorithms for XAI: From a White Box to a Black Box

03/04/2022
by   Yu-Liang Chou, et al.
19

Counterfactual explanations have recently been brought to light as a potentially crucial response to obtaining human-understandable explanations from predictive models in Explainable Artificial Intelligence (XAI). Despite the fact that various counterfactual algorithms have been proposed, the state of the art research still lacks standardised protocols to evaluate the quality of counterfactual explanations. In this work, we conducted a benchmark evaluation across different model agnostic counterfactual algorithms in the literature (DiCE, WatcherCF, prototype, unjustifiedCF), and we investigated the counterfactual generation process on different types of machine learning models ranging from a white box (decision tree) to a grey-box (random forest) and a black box (neural network). We evaluated the different counterfactual algorithms using several metrics including proximity, interpretability and functionality for five datasets. The main findings of this work are the following: (1) without guaranteeing plausibility in the counterfactual generation process, one cannot have meaningful evaluation results. This means that all explainable counterfactual algorithms that do not take into consideration plausibility in their internal mechanisms cannot be evaluated with the current state of the art evaluation metrics; (2) the counterfactual generated are not impacted by the different types of machine learning models; (3) DiCE was the only tested algorithm that was able to generate actionable and plausible counterfactuals, because it provides mechanisms to constraint features; (4) WatcherCF and UnjustifiedCF are limited to continuous variables and can not deal with categorical data.

READ FULL TEXT

page 5

page 9

page 24

page 26

research
08/22/2022

Shapelet-Based Counterfactual Explanations for Multivariate Time Series

As machine learning and deep learning models have become highly prevalen...
research
07/19/2021

Interpreting Process Predictions using a Milestone-Aware Counterfactual Approach

Predictive process analytics often apply machine learning to predict the...
research
03/02/2023

Counterfactual Edits for Generative Evaluation

Evaluation of generative models has been an underrepresented field despi...
research
09/22/2022

Counterfactual Explanations Using Optimization With Constraint Learning

Counterfactual explanations embody one of the many interpretability tech...
research
07/09/2021

A Framework and Benchmarking Study for Counterfactual Generating Methods on Tabular Data

Counterfactual explanations are viewed as an effective way to explain ma...
research
10/22/2021

ReLACE: Reinforcement Learning Agent for Counterfactual Explanations of Arbitrary Predictive Models

The demand for explainable machine learning (ML) models has been growing...
research
08/14/2023

Explaining Black-Box Models through Counterfactuals

We present CounterfactualExplanations.jl: a package for generating Count...

Please sign up or login with your details

Forgot password? Click here to reset