DeepAI AI Chat
Log In Sign Up

Efficient computation of contrastive explanations

by   André Artelt, et al.

With the increasing deployment of machine learning systems in practice, transparency and explainability have become serious issues. Contrastive explanations are considered to be useful and intuitive, in particular when it comes to explaining decisions to lay people, since they mimic the way in which humans explain. Yet, so far, comparably little research has addressed computationally feasible technologies, which allow guarantees on uniqueness and optimality of the explanation and which enable an easy incorporation of additional constraints. Here, we will focus on specific types of models rather than black-box technologies. We study the relation of contrastive and counterfactual explanations and propose mathematical formalizations as well as a 2-phase algorithm for efficiently computing pertinent positives of many standard machine learning models.


page 1

page 2

page 3

page 4


On the computation of counterfactual explanations – A survey

Due to the increasing use of machine learning in practice it becomes mor...

Explaining Explanations in AI

Recent work on interpretability in machine learning and AI has focused o...

One Explanation Does Not Fit All: The Promise of Interactive Explanations for Machine Learning Transparency

The need for transparency of predictive systems based on Machine Learnin...

Explaining NLP Models via Minimal Contrastive Editing (MiCE)

Humans give contrastive explanations that explain why an observed event ...

Influence-Driven Explanations for Bayesian Network Classifiers

One of the most pressing issues in AI in recent years has been the need ...

Contrastive Corpus Attribution for Explaining Representations

Despite the widespread use of unsupervised models, very few methods are ...

Why X rather than Y? Explaining Neural Model' Predictions by Generating Intervention Counterfactual Samples

Even though the topic of explainable AI/ML is very popular in text and c...

Code Repositories