Costs and Benefits of Wasserstein Fair Regression

06/16/2021
by   Han Zhao, et al.
0

Real-world applications of machine learning tools in high-stakes domains are often regulated to be fair, in the sense that the predicted target should satisfy some quantitative notion of parity with respect to a protected attribute. However, the exact tradeoff between fairness and accuracy with a real-valued target is not clear. In this paper, we characterize the inherent tradeoff between statistical parity and accuracy in the regression setting by providing a lower bound on the error of any fair regressor. Our lower bound is sharp, algorithm-independent, and admits a simple interpretation: when the moments of the target differ between groups, any fair algorithm has to make a large error on at least one of the groups. We further extend this result to give a lower bound on the joint error of any (approximately) fair algorithm, using the Wasserstein distance to measure the quality of the approximation. On the upside, we establish the first connection between individual fairness, accuracy parity, and the Wasserstein distance by showing that if a regressor is individually fair, it also approximately verifies the accuracy parity, where the gap is given by the Wasserstein distance between the two groups. Inspired by our theoretical results, we develop a practical algorithm for fair regression through the lens of representation learning, and conduct experiments on a real-world dataset to corroborate our findings.

READ FULL TEXT
research
06/19/2019

Inherent Tradeoffs in Learning Fair Representation

With the prevalence of machine learning in high-stakes applications, esp...
research
06/12/2020

Fair Regression with Wasserstein Barycenters

We study the problem of learning a real-valued function that satisfies t...
research
05/30/2022

Metrizing Fairness

We study supervised learning problems for predicting properties of indiv...
research
05/30/2019

Fair Regression: Quantitative Definitions and Reduction-based Algorithms

In this paper, we study the prediction of a real-valued target, such as ...
research
09/01/2022

Fair learning with Wasserstein barycenters for non-decomposable performance measures

This work provides several fundamental characterizations of the optimal ...
research
12/19/2020

Fundamental Limits and Tradeoffs in Invariant Representation Learning

Many machine learning applications involve learning representations that...
research
09/07/2023

Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck

This work investigates the nuanced algorithm design choices for deep lea...

Please sign up or login with your details

Forgot password? Click here to reset