Fairness without Imputation: A Decision Tree Approach for Fair Prediction with Missing Values

09/21/2021
by   Haewon Jeong, et al.
12

We investigate the fairness concerns of training a machine learning model using data with missing values. Even though there are a number of fairness intervention methods in the literature, most of them require a complete training set as input. In practice, data can have missing values, and data missing patterns can depend on group attributes (e.g. gender or race). Simply applying off-the-shelf fair learning algorithms to an imputed dataset may lead to an unfair model. In this paper, we first theoretically analyze different sources of discrimination risks when training with an imputed dataset. Then, we propose an integrated approach based on decision trees that does not require a separate process of imputation and learning. Instead, we train a tree with missing incorporated as attribute (MIA), which does not require explicit imputation, and we optimize a fairness-regularized objective function. We demonstrate that our approach outperforms existing fairness intervention methods applied to an imputed dataset, through several experiments on real-world datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/30/2023

Adapting Fairness Interventions to Missing Values

Missing values in real-world data pose a significant and unique challeng...
research
12/21/2017

Fair Forests: Regularized Tree Induction to Minimize Model Bias

The potential lack of fairness in the outputs of machine learning algori...
research
02/19/2019

On the consistency of supervised learning with missing values

In many application settings, the data are plagued with missing features...
research
05/10/2022

Explainable Data Imputation using Constraints

Data values in a dataset can be missing or anomalous due to mishandling ...
research
12/13/2022

Fair Infinitesimal Jackknife: Mitigating the Influence of Biased Training Data Points Without Refitting

In consequential decision-making applications, mitigating unwanted biase...
research
12/04/2020

Machine learning with incomplete datasets using multi-objective optimization models

Machine learning techniques have been developed to learn from complete d...
research
10/16/2017

Fair Kernel Learning

New social and economic activities massively exploit big data and machin...

Please sign up or login with your details

Forgot password? Click here to reset