Counterfactual Explanations Adversarial Examples – Common Grounds, Essential Differences, and Potential Transfers

09/11/2020
by   Timo Freiesleben, et al.
0

It is well known that adversarial examples and counterfactual explanations are based on the same mathematical model. However, their relationship has not yet been studied at a conceptual level. The present paper fills this gap. We show that counterfactual reasoning is the common basis of the fields and reliable machine learning their shared goal. Moreover, we illustrate to what extent counterfactual explanations can be regarded as the more general concept than adversarial examples. We introduce the conceptual distinction between feasible and contesting counterfactual explanations and argue that adversarial examples are similar to the latter.

READ FULL TEXT

page 6

page 36

research
06/18/2021

On the Connections between Counterfactual Explanations and Adversarial Examples

Counterfactual explanations and adversarial examples have emerged as cri...
research
12/18/2020

Semantics and explanation: why counterfactual explanations produce adversarial examples in deep neural networks

Recent papers in explainable AI have made a compelling case for counterf...
research
03/01/2021

Counterfactual Explanations for Oblique Decision Trees: Exact, Efficient Algorithms

We consider counterfactual explanations, the problem of minimally adjust...
research
11/18/2021

MCCE: Monte Carlo sampling of realistic counterfactual explanations

In this paper we introduce MCCE: Monte Carlo sampling of realistic Count...
research
09/20/2019

FACE: Feasible and Actionable Counterfactual Explanations

Work in Counterfactual Explanations tends to focus on the principle of "...
research
11/30/2017

ConvNets and ImageNet Beyond Accuracy: Explanations, Bias Detection, Adversarial Examples and Model Criticism

ConvNets and Imagenet have driven the recent success of deep learning fo...
research
10/06/2021

Consistent Counterfactuals for Deep Models

Counterfactual examples are one of the most commonly-cited methods for e...

Please sign up or login with your details

Forgot password? Click here to reset