Counterfactual Generation and Fairness Evaluation Using Adversarially Learned Inference

09/17/2020
by   Saloni Dash, et al.
12

Recent studies have reported biases in machine learning image classifiers, especially against particular demographic groups. Counterfactual examples for an input—perturbations that change specific features but not others—have been shown to be useful for evaluating explainability and fairness of machine learning models. However, generating counterfactual examples for images is non-trivial due to the underlying causal structure governing the various features of an image. To be meaningful, generated perturbations need to satisfy constraints implied by the causal model. We present a method for generating counterfactuals by incorporating a known causal graph structure in a conditional variant of Adversarially Learned Inference (ALI). The proposed approach learns causal relationships between the specified attributes of an image and generates counterfactuals in accordance with these relationships. On Morpho-MNIST and CelebA datasets, the method generates counterfactuals that can change specified attributes and their causal descendants while keeping other attributes constant. As an application, we apply the generated counterfactuals from CelebA images to evaluate fairness biases in a classifier that predicts attractiveness of a face.

READ FULL TEXT

page 10

page 12

page 22

page 23

research
01/10/2022

Learning Fair Node Representations with Graph Counterfactual Fairness

Fair machine learning aims to mitigate the biases of model predictions a...
research
10/03/2021

Enhancing Model Robustness and Fairness with Causality: A Regularization Approach

Recent work has raised concerns on the risk of spurious correlations and...
research
12/06/2019

Preserving Causal Constraints in Counterfactual Explanations for Machine Learning Classifiers

Explaining the output of a complex machine learning (ML) model often req...
research
02/14/2023

A Friendly Face: Do Text-to-Image Systems Rely on Stereotypes when the Input is Under-Specified?

As text-to-image systems continue to grow in popularity with the general...
research
06/15/2020

Causal Inference with Deep Causal Graphs

Parametric causal modelling techniques rarely provide functionality for ...
research
11/15/2019

Fair Data Adaptation with Quantile Preservation

Fairness of classification and regression has received much attention re...
research
05/10/2022

Towards Intersectionality in Machine Learning: Including More Identities, Handling Underrepresentation, and Performing Evaluation

Research in machine learning fairness has historically considered a sing...

Please sign up or login with your details

Forgot password? Click here to reset