SCOUT: Self-aware Discriminant Counterfactual Explanations

04/16/2020
by   Pei Wang, et al.
0

The problem of counterfactual visual explanations is considered. A new family of discriminant explanations is introduced. These produce heatmaps that attribute high scores to image regions informative of a classifier prediction but not of a counter class. They connect attributive explanations, which are based on a single heat map, to counterfactual explanations, which account for both predicted class and counter class. The latter are shown to be computable by combination of two discriminant explanations, with reversed class pairs. It is argued that self-awareness, namely the ability to produce classification confidence scores, is important for the computation of discriminant explanations, which seek to identify regions where it is easy to discriminate between prediction and counter class. This suggests the computation of discriminant explanations by the combination of three attribution maps. The resulting counterfactual explanations are optimization free and thus much faster than previous methods. To address the difficulty of their evaluation, a proxy task and set of quantitative metrics are also proposed. Experiments under this protocol show that the proposed counterfactual explanations outperform the state of the art while achieving much higher speeds, for popular networks. In a human-learning machine teaching experiment, they are also shown to improve mean student accuracy from chance level to 95%.

READ FULL TEXT

page 1

page 8

research
04/16/2019

Counterfactual Visual Explanations

A counterfactual query is typically of the form 'For situation X, why wa...
research
12/21/2022

VCNet: A self-explaining model for realistic counterfactual generation

Counterfactual explanation is a common class of methods to make local ex...
research
09/22/2022

Counterfactual Explanations Using Optimization With Constraint Learning

Counterfactual explanations embody one of the many interpretability tech...
research
03/24/2022

Making Heads or Tails: Towards Semantically Consistent Visual Counterfactuals

A visual counterfactual explanation replaces image regions in a query im...
research
12/02/2021

Counterfactual Explanations via Latent Space Projection and Interpolation

Counterfactual explanations represent the minimal change to a data sampl...
research
08/08/2019

Measurable Counterfactual Local Explanations for Any Classifier

We propose a novel method for explaining the predictions of any classifi...
research
07/21/2021

Answer-Set Programs for Reasoning about Counterfactual Interventions and Responsibility Scores for Classification

We describe how answer-set programs can be used to declaratively specify...

Please sign up or login with your details

Forgot password? Click here to reset