Log In Sign Up

Impact of Data Processing on Fairness in Supervised Learning

by   Sajad Khodadadian, et al.

We study the impact of pre and post processing for reducing discrimination in data-driven decision makers. We first analyze the fundamental trade-off between fairness and accuracy in a pre-processing approach, and propose a design for a pre-processing module based on a convex optimization program, which can be added before the original classifier. This leads to a fundamental lower bound on attainable discrimination, given any acceptable distortion in the outcome. Furthermore, we reformulate an existing post-processing method in terms of our accuracy and fairness measures, which allows comparing post-processing and pre-processing approaches. We show that under some mild conditions, pre-processing outperforms post-processing. Finally, we show that by appropriate choice of the discrimination measure, the optimization problem for both pre and post processing approaches will reduce to a linear program and hence can be solved efficiently.


page 1

page 2

page 3

page 4


The Influence of Data Pre-processing and Post-processing on Long Document Summarization

Long document summarization is an important and hard task in the field o...

Optimized Data Pre-Processing for Discrimination Prevention

Non-discrimination is a recognized objective in algorithmic decision mak...

Adaptive Fairness Improvement Based on Causality Analysis

Given a discriminating neural network, the problem of fairness improveme...

Fair Data Representation for Machine Learning at the Pareto Frontier

As machine learning powered decision making is playing an increasingly i...

Classification with abstention but without disparities

Classification with abstention has gained a lot of attention in recent y...

fairlib: A Unified Framework for Assessing and Improving Classification Fairness

This paper presents fairlib, an open-source framework for assessing and ...

Post-Processing Temporal Action Detection

Existing Temporal Action Detection (TAD) methods typically take a pre-pr...