Why Is My Classifier Discriminatory?

05/30/2018
by   Irene Chen, et al.
0

Recent attempts to achieve fairness in predictive models focus on the balance between fairness and accuracy. In sensitive applications such as healthcare or criminal justice, this trade-off is often undesirable as any increase in prediction error could have devastating consequences. In this work, we argue that the fairness of predictions should be evaluated in context of the data, and that unfairness induced by inadequate samples sizes or unmeasured predictive variables should be addressed through data collection, rather than by constraining the model. We decompose cost-based metrics of discrimination into bias, variance, and noise, and propose actions aimed at estimating and reducing each term. Finally, we perform case-studies on prediction of income, mortality, and review ratings, confirming the value of this analysis. We find that data collection is often a means to reduce discrimination without sacrificing accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/15/2022

More Data Can Lead Us Astray: Active Data Acquisition in the Presence of Label Bias

An increased awareness concerning risks of algorithmic bias has driven a...
research
01/02/2019

Cost-sensitive Selection of Variables by Ensemble of Model Sequences

Many applications require the collection of data on different variables ...
research
11/16/2019

Fairness With Minimal Harm: A Pareto-Optimal Approach For Healthcare

Common fairness definitions in machine learning focus on balancing notio...
research
05/18/2023

Prevention is better than cure: a case study of the abnormalities detection in the chest

Prevention is better than cure. This old truth applies not only to the p...
research
10/24/2019

Fairness Sample Complexity and the Case for Human Intervention

With the aim of building machine learning systems that incorporate stand...
research
05/21/2015

On the relation between accuracy and fairness in binary classification

Our study revisits the problem of accuracy-fairness tradeoff in binary c...
research
06/13/2022

A Bayesian Model to Estimate Abundance Based on Scarce Animal Vestige Data

We propose a modelling framework which allows for the estimation of abun...

Please sign up or login with your details

Forgot password? Click here to reset