MACE: An Efficient Model-Agnostic Framework for Counterfactual Explanation

05/31/2022
by   Wenzhuo Yang, et al.
0

Counterfactual explanation is an important Explainable AI technique to explain machine learning predictions. Despite being studied actively, existing optimization-based methods often assume that the underlying machine-learning model is differentiable and treat categorical attributes as continuous ones, which restricts their real-world applications when categorical attributes have many different values or the model is non-differentiable. To make counterfactual explanation suitable for real-world applications, we propose a novel framework of Model-Agnostic Counterfactual Explanation (MACE), which adopts a newly designed pipeline that can efficiently handle non-differentiable machine-learning models on a large number of feature values. in our MACE approach, we propose a novel RL-based method for finding good counterfactual examples and a gradient-less descent method for improving proximity. Experiments on public datasets validate the effectiveness with better validity, sparsity and proximity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2021

Prototype-based Counterfactual Explanation for Causal Classification

Counterfactual explanation is one branch of interpretable machine learni...
research
06/25/2019

Explaining Deep Learning Models with Constrained Adversarial Examples

Machine learning algorithms generally suffer from a problem of explainab...
research
06/04/2021

Model-agnostic and Scalable Counterfactual Explanations via Reinforcement Learning

Counterfactual instances are a powerful tool to obtain valuable insights...
research
09/20/2021

Counterfactual Instances Explain Little

In many applications, it is important to be able to explain the decision...
research
04/15/2021

NICE: An Algorithm for Nearest Instance Counterfactual Explanations

In this paper we suggest NICE: a new algorithm to generate counterfactua...
research
03/26/2023

CeFlow: A Robust and Efficient Counterfactual Explanation Framework for Tabular Data using Normalizing Flows

Counterfactual explanation is a form of interpretable machine learning t...
research
07/20/2022

Learning Counterfactually Invariant Predictors

We propose a method to learn predictors that are invariant under counter...

Please sign up or login with your details

Forgot password? Click here to reset