Abstracting Fairness: Oracles, Metrics, and Interpretability

04/04/2020
by   Cynthia Dwork, et al.
1

It is well understood that classification algorithms, for example, for deciding on loan applications, cannot be evaluated for fairness without taking context into account. We examine what can be learned from a fairness oracle equipped with an underlying understanding of “true” fairness. The oracle takes as input a (context, classifier) pair satisfying an arbitrary fairness definition, and accepts or rejects the pair according to whether the classifier satisfies the underlying fairness truth. Our principal conceptual result is an extraction procedure that learns the underlying truth; moreover, the procedure can learn an approximation to this truth given access to a weak form of the oracle. Since every “truly fair” classifier induces a coarse metric, in which those receiving the same decision are at distance zero from one another and those receiving different decisions are at distance one, this extraction process provides the basis for ensuring a rough form of metric fairness, also known as individual fairness. Our principal technical result is a higher fidelity extractor under a mild technical constraint on the weak oracle's conception of fairness. Our framework permits the scenario in which many classifiers, with differing outcomes, may all be considered fair. Our results have implications for interpretablity – a highly desired but poorly defined property of classification systems that endeavors to permit a human arbiter to reject classifiers deemed to be “unfair” or illegitimately derived.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/14/2017

Preventing Fairness Gerrymandering: Auditing and Learning for Subgroup Fairness

The most prevalent notions of fairness in machine learning are statistic...
research
05/20/2022

CertiFair: A Framework for Certified Global Fairness of Neural Networks

We consider the problem of whether a Neural Network (NN) model satisfies...
research
03/08/2018

Probably Approximately Metric-Fair Learning

We study fairness in machine learning. A learning algorithm, given a tra...
research
02/08/2021

Learning to Generate Fair Clusters from Demonstrations

Fair clustering is the process of grouping similar entities together, wh...
research
06/21/2020

Verifying Individual Fairness in Machine Learning Models

We consider the problem of whether a given decision model, working with ...
research
09/23/2018

Envy-Free Classification

In classic fair division problems such as cake cutting and rent division...
research
07/22/2019

A Conceptual Framework for Evaluating Fairness in Search

While search efficacy has been evaluated traditionally on the basis of r...

Please sign up or login with your details

Forgot password? Click here to reset