A survey on datasets for fairness-aware machine learning

10/01/2021
by   Tai Le Quy, et al.
0

As decision-making increasingly relies on machine learning and (big) data, the issue of fairness in data-driven AI systems is receiving increasing attention from both research and industry. A large variety of fairness-aware machine learning solutions have been proposed which propose fairness-related interventions in the data, learning algorithms and/or model outputs. However, a vital part of proposing new approaches is evaluating them empirically on benchmark datasets that represent realistic and diverse settings. Therefore, in this paper, we overview real-world datasets used for fairness-aware machine learning. We focus on tabular data as the most common data representation for fairness-aware machine learning. We start our analysis by identifying relationships among the different attributes, particularly w.r.t. protected attributes and class attributes, using a Bayesian network. For a deeper understanding of bias and fairness in the datasets, we investigate the interesting relationships using exploratory analysis.

READ FULL TEXT

page 17

page 20

research
02/03/2020

FAE: A Fairness-Aware Ensemble Framework

Automated decision making based on big data and machine learning (ML) al...
research
07/31/2023

A Suite of Fairness Datasets for Tabular Classification

There have been many papers with algorithms for improving fairness of ma...
research
01/18/2021

Through the Data Management Lens: Experimental Analysis and Evaluation of Fair Classification

Classification, a heavily-studied data-driven machine learning task, dri...
research
02/03/2022

Algorithmic Fairness Datasets: the Story so Far

Data-driven algorithms are being studied and deployed in diverse domains...
research
02/13/2018

A comparative study of fairness-enhancing interventions in machine learning

Computers are increasingly used to make decisions that have significant ...
research
08/02/2017

Fairness-aware machine learning: a perspective

Algorithms learned from data are increasingly used for deciding many asp...
research
10/08/2020

Metrics and methods for a systematic comparison of fairness-aware machine learning algorithms

Understanding and removing bias from the decisions made by machine learn...

Please sign up or login with your details

Forgot password? Click here to reset