Through the Data Management Lens: Experimental Analysis and Evaluation of Fair Classification

01/18/2021
by   Maliha Tashfia Islam, et al.
0

Classification, a heavily-studied data-driven machine learning task, drives an increasing number of prediction systems involving critical human decisions such as loan approval and criminal risk assessment. However, classifiers often demonstrate discriminatory behavior, especially when presented with biased data. Consequently, fairness in classification has emerged as a high-priority research area. Data management research is showing an increasing presence and interest in topics related to data and algorithmic fairness, including the topic of fair classification. The interdisciplinary efforts in fair classification, with machine learning research having the largest presence, have resulted in a large number of fairness notions and a wide range of approaches that have not been systematically evaluated and compared. In this paper, we contribute a broad analysis of 13 fair classification approaches and additional variants, over their correctness, fairness, efficiency, scalability, and stability, using a variety of metrics and real-world datasets. Our analysis highlights novel insights on the impact of different metrics and high-level approach characteristics on different aspects of performance. We also discuss general principles for choosing approaches suitable for different practical settings, and identify areas where data-management-centric solutions are likely to have the most impact.

READ FULL TEXT
research
07/05/2022

Developing a Philosophical Framework for Fair Machine Learning: The Case of Algorithmic Collusion and Market Fairness

Fair machine learning research has been primarily concerned with classif...
research
10/01/2021

A survey on datasets for fairness-aware machine learning

As decision-making increasingly relies on machine learning and (big) dat...
research
06/24/2020

Fairness with Overlapping Groups

In algorithmically fair prediction problems, a standard goal is to ensur...
research
06/01/2021

Fair Clustering Using Antidote Data

Clustering algorithms are widely utilized for many modern data science a...
research
06/07/2018

Residual Unfairness in Fair Machine Learning from Prejudiced Data

Recent work in fairness in machine learning has proposed adjusting for f...
research
02/24/2022

Attainability and Optimality: The Equalized Odds Fairness Revisited

Fairness of machine learning algorithms has been of increasing interest....
research
01/08/2020

Algorithmic Fairness from a Non-ideal Perspective

Inspired by recent breakthroughs in predictive modeling, practitioners i...

Please sign up or login with your details

Forgot password? Click here to reset