Explaining Classifiers with Causal Concept Effect (CaCE)

07/16/2019
by   Yash Goyal, et al.
0

How can we understand classification decisions made by deep neural nets? We propose answering this question by using ideas from causal inference. We define the "Causal Concept Effect" (CaCE) as the causal effect that the presence or absence of a concept has on the prediction of a given deep neural net. We then use this measure as a mean to understand what drives the network's prediction and what does not. Yet many existing interpretability methods rely solely on correlations, resulting in potentially misleading explanations. We show how CaCE can avoid such mistakes. In high-risk domains such as medicine, knowing the root cause of the prediction is crucial. If we knew that the network's prediction was caused by arbitrary concepts such as the lighting conditions in an X-ray room instead of medically meaningful concept, this would prevent us from disastrous deployment of such models. Estimating CaCE is difficult in situations where we cannot easily simulate the do-operator. As a simple solution, we propose learning a generative model, specifically a Variational AutoEncoder (VAE) on image pixels or image embeddings extracted from the classifier to measure VAE-CaCE. We show that VAE-CaCE is able to correctly estimate the true causal effect as compared to other baselines in controlled settings with synthetic and semi-natural high dimensional images.

READ FULL TEXT
research
05/27/2020

CausaLM: Causal Model Explanation Through Counterfactual Language Models

Understanding predictions made by deep neural networks is notoriously di...
research
04/18/2020

CausalVAE: Structured Causal Disentanglement in Variational Autoencoder

Learning disentanglement aims at finding a low dimensional representatio...
research
01/16/2023

Causal Recurrent Variational Autoencoder for Medical Time Series Generation

We propose causal recurrent variational autoencoder (CR-VAE), a novel ge...
research
03/01/2023

Learning high-dimensional causal effect

The scarcity of high-dimensional causal inference datasets restricts the...
research
08/09/2018

Linked Causal Variational Autoencoder for Inferring Paired Spillover Effects

Modeling spillover effects from observational data is an important probl...
research
12/13/2022

On the Relationship Between Explanation and Prediction: A Causal View

Explainability has become a central requirement for the development, dep...
research
06/28/2020

Causal Explanations of Image Misclassifications

The causal explanation of image misclassifications is an understudied ni...

Please sign up or login with your details

Forgot password? Click here to reset