Multi-Objective Counterfactual Explanations

04/23/2020
by   Susanne Dandl, et al.
0

Counterfactual explanations are one of the most popular methods to make predictions of black box machine learning models interpretable by providing explanations in the form of `what-if scenarios'. Current approaches can compute counterfactuals only for certain model classes or feature types, or they generate counterfactuals that are not consistent with the observed data distribution. To overcome these limitations, we propose the Multi-Objective Counterfactuals (MOC) method, which translates the counterfactual search into a multi-objective optimization problem and solves it with a genetic algorithm based on NSGA-II. It returns a diverse set of counterfactuals with different trade-offs between the proposed objectives, enabling either a more detailed post-hoc analysis to facilitate better understanding or more options for actionable user responses to change the predicted outcome. We show the usefulness of MOC in concrete cases and compare our approach with state-of-the-art methods for counterfactual explanations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/17/2021

Optimal Counterfactual Explanations for Scorecard modelling

Counterfactual explanations is one of the post-hoc methods used to provi...
research
09/27/2022

Learning to Counter: Stochastic Feature-based Learning for Diverse Counterfactual Explanations

Interpretable machine learning seeks to understand the reasoning process...
research
04/12/2021

Consequence-aware Sequential Counterfactual Generation

Counterfactuals have become a popular technique nowadays for interacting...
research
11/29/2021

Multi-objective Explanations of GNN Predictions

Graph Neural Network (GNN) has achieved state-of-the-art performance in ...
research
04/08/2023

Counterfactual Explanations of Neural Network-Generated Response Curves

Response curves exhibit the magnitude of the response of a sensitive sys...
research
05/16/2022

Gradient-based Counterfactual Explanations using Tractable Probabilistic Models

Counterfactual examples are an appealing class of post-hoc explanations ...

Please sign up or login with your details

Forgot password? Click here to reset