Measuring and signing fairness as performance under multiple stakeholder distributions

07/20/2022
by   David Lopez-Paz, et al.
0

As learning machines increase their influence on decisions concerning human lives, analyzing their fairness properties becomes a subject of central importance. Yet, our best tools for measuring the fairness of learning systems are rigid fairness metrics encapsulated as mathematical one-liners, offer limited power to the stakeholders involved in the prediction task, and are easy to manipulate when we exhort excessive pressure to optimize them. To advance these issues, we propose to shift focus from shaping fairness metrics to curating the distributions of examples under which these are computed. In particular, we posit that every claim about fairness should be immediately followed by the tagline "Fair under what examples, and collected by whom?". By highlighting connections to the literature in domain generalization, we propose to measure fairness as the ability of the system to generalize under multiple stress tests – distributions of examples with social relevance. We encourage each stakeholder to curate one or multiple stress tests containing examples reflecting their (possibly conflicting) interests. The machine passes or fails each stress test by falling short of or exceeding a pre-defined metric value. The test results involve all stakeholders in a discussion about how to improve the learning system, and provide flexible assessments of fairness dependent on context and based on interpretable data. We provide full implementation guidelines for stress testing, illustrate both the benefits and shortcomings of this framework, and introduce a cryptographic scheme to enable a degree of prediction accountability from system providers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/17/2023

Fairness in KI-Systemen

The more AI-assisted decisions affect people's lives, the more important...
research
11/25/2018

50 Years of Test (Un)fairness: Lessons for Machine Learning

Quantitative definitions of what is unfair and what is fair have been in...
research
07/22/2019

A Conceptual Framework for Evaluating Fairness in Search

While search efficacy has been evaluated traditionally on the basis of r...
research
03/10/2021

Fairness On The Ground: Applying Algorithmic Fairness Approaches to Production Systems

Many technical approaches have been proposed for ensuring that decisions...
research
06/23/2020

Fair Performance Metric Elicitation

What is a fair performance metric? We consider the choice of fairness me...
research
07/10/2020

Evaluating Fairness Using Permutation Tests

Machine learning models are central to people's lives and impact society...
research
07/27/2022

Should Bank Stress Tests Be Fair?

Regulatory stress tests have become the primary tool for setting capital...

Please sign up or login with your details

Forgot password? Click here to reset