Integrating Prior Knowledge in Post-hoc Explanations

04/25/2022
by   Adulam Jeyasothy, et al.
6

In the field of eXplainable Artificial Intelligence (XAI), post-hoc interpretability methods aim at explaining to a user the predictions of a trained decision model. Integrating prior knowledge into such interpretability methods aims at improving the explanation understandability and allowing for personalised explanations adapted to each user. In this paper, we propose to define a cost function that explicitly integrates prior knowledge into the interpretability objectives: we present a general framework for the optimization problem of post-hoc interpretability methods, and show that user knowledge can thus be integrated to any method by adding a compatibility term in the cost function. We instantiate the proposed formalization in the case of counterfactual explanations and propose a new interpretability method called Knowledge Integration in Counterfactual Explanation (KICE) to optimize it. The paper performs an experimental study on several benchmark data sets to characterize the counterfactual instances generated by KICE, as compared to reference methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/22/2019

The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations

Post-hoc interpretability approaches have been proven to be powerful too...
research
04/19/2021

DA-DGCEx: Ensuring Validity of Deep Guided Counterfactual Explanations With Distribution-Aware Autoencoder Loss

Deep Learning has become a very valuable tool in different fields, and n...
research
05/10/2023

Achieving Diversity in Counterfactual Explanations: a Review and Discussion

In the field of Explainable Artificial Intelligence (XAI), counterfactua...
research
12/22/2017

Inverse Classification for Comparison-based Interpretability in Machine Learning

In the context of post-hoc interpretability, this paper addresses the ta...
research
05/06/2022

Let's Go to the Alien Zoo: Introducing an Experimental Framework to Study Usability of Counterfactual Explanations for Machine Learning

To foster usefulness and accountability of machine learning (ML), it is ...
research
06/24/2021

Meaningfully Explaining a Model's Mistakes

Understanding and explaining the mistakes made by trained models is crit...
research
12/04/2018

Multimodal Explanations by Predicting Counterfactuality in Videos

This study addresses generating counterfactual explanations with multimo...

Please sign up or login with your details

Forgot password? Click here to reset