VAE-CE: Visual Contrastive Explanation using Disentangled VAEs

08/20/2021
by   Yoeri Poels, et al.
9

The goal of a classification model is to assign the correct labels to data. In most cases, this data is not fully described by the given set of labels. Often a rich set of meaningful concepts exist in the domain that can much more precisely describe each datapoint. Such concepts can also be highly useful for interpreting the model's classifications. In this paper we propose a model, denoted as Variational Autoencoder-based Contrastive Explanation (VAE-CE), that represents data with high-level concepts and uses this representation for both classification and generating explanations. The explanations are produced in a contrastive manner, conveying why a datapoint is assigned to one class rather than an alternative class. An explanation is specified as a set of transformations of the input datapoint, with each step depicting a concept changing towards the contrastive class. We build the model using a disentangled VAE, extended with a new supervised method for disentangling individual dimensions. An analysis on synthetic data and MNIST shows that the approaches to both disentanglement and explanation provide benefits over other methods.

READ FULL TEXT

page 3

page 5

page 6

page 7

page 8

page 13

page 14

research
02/23/2023

Causally Disentangled Generative Variational AutoEncoder

We propose a new supervised learning method for Variational AutoEncoder ...
research
03/05/2023

CoRTX: Contrastive Framework for Real-time Explanation

Recent advancements in explainable machine learning provide effective an...
research
03/02/2021

Contrastive Explanations for Model Interpretability

Contrastive explanations clarify why an event occurred in contrast to an...
research
05/29/2023

Reason to explain: Interactive contrastive explanations (REASONX)

Many high-performing machine learning models are not interpretable. As t...
research
05/27/2018

Semantic Explanations of Predictions

The main objective of explanations is to transmit knowledge to humans. T...
research
05/31/2019

Model Agnostic Contrastive Explanations for Structured Data

Recently, a method [7] was proposed to generate contrastive explanations...
research
04/01/2022

Provable concept learning for interpretable predictions using variational inference

In safety critical applications, practitioners are reluctant to trust ne...

Please sign up or login with your details

Forgot password? Click here to reset