On the Direction of Discrimination: An Information-Theoretic Analysis of Disparate Impact in Machine Learning

01/16/2018
by   Hao Wang, et al.
0

In the context of machine learning, disparate impact refers to a form of systematic discrimination whereby the output distribution of a model depends on the value of a sensitive attribute (e.g., race or gender). In this paper, we present an information-theoretic framework to analyze the disparate impact of a binary classification model. We view the model as a fixed channel, and quantify disparate impact as the divergence in output distributions over two groups. We then aim to find a correction function that can be used to perturb the input distributions of each group in order to align their output distributions. We present an optimization problem that can be solved to obtain a correction function that will make the output distributions statistically indistinguishable. We derive closed-form expression for the correction function that can be used to compute it efficiently. We illustrate the use of the correction function for a recidivism prediction application derived from the ProPublica COMPAS dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/21/2021

A note on some information-theoretic divergences between Zeta distributions

In this short communication, we first report a closed-form formula for c...
research
01/13/2018

Fairness in Supervised Learning: An Information Theoretic Approach

Automated decision making systems are increasingly being used in real-wo...
research
02/12/2020

To Split or Not to Split: The Impact of Disparate Treatment in Classification

Disparate treatment occurs when a machine learning model produces differ...
research
06/01/2021

Information Theoretic Measures for Fairness-aware Feature Selection

Machine learning algorithms are increasingly used for consequential deci...
research
01/29/2019

Repairing without Retraining: Avoiding Disparate Impact with Counterfactual Distributions

When the average performance of a prediction model varies significantly ...
research
02/13/2018

The Birthday Problem and Zero-Error List Codes

As an attempt to bridge the gap between classical information theory and...
research
04/25/2018

Characterizing Information Propagation in Plants

This paper considers an electro-chemical based communication model for i...

Please sign up or login with your details

Forgot password? Click here to reset