Towards a Measure of Individual Fairness for Deep Learning

09/28/2020
by   Krystal Maughan, et al.
0

Deep learning has produced big advances in artificial intelligence, but trained neural networks often reflect and amplify bias in their training data, and thus produce unfair predictions. We propose a novel measure of individual fairness, called prediction sensitivity, that approximates the extent to which a particular prediction is dependent on a protected attribute. We show how to compute prediction sensitivity using standard automatic differentiation capabilities present in modern deep learning frameworks, and present preliminary empirical results suggesting that prediction sensitivity may be effective for measuring bias in individual predictions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/30/2020

Towards Auditability for Fairness in Deep Learning

Group fairness metrics can detect when a deep learning model behaves dif...
research
03/16/2022

Measuring Fairness of Text Classifiers via Prediction Sensitivity

With the rapid growth in language processing applications, fairness has ...
research
03/15/2022

Distraction is All You Need for Fairness

With the recent growth in artificial intelligence models and its expandi...
research
02/09/2022

Prediction Sensitivity: Continual Audit of Counterfactual Fairness in Deployed Classifiers

As AI-based systems increasingly impact many areas of our lives, auditin...
research
11/03/2022

Can Querying for Bias Leak Protected Attributes? Achieving Privacy With Smooth Sensitivity

Existing regulations prohibit model developers from accessing protected ...
research
07/17/2023

Certifying the Fairness of KNN in the Presence of Dataset Bias

We propose a method for certifying the fairness of the classification re...
research
09/16/2022

On the Relation between Sensitivity and Accuracy in In-context Learning

In-context learning (ICL) suffers from oversensitivity to the prompt, wh...

Please sign up or login with your details

Forgot password? Click here to reset