Principal Fairness: Removing Bias via Projections

05/31/2019
by   Aris Anagnostopoulos, et al.
0

Reducing hidden bias in the data and ensuring fairness in algorithmic data analysis has recently received significant attention. We complement several recent papers in this line of research by introducing a general method to reduce bias in the data through random projections in a "fair" subspace. We apply this method to densest subgraph and k-means. For densest subgraph, our approach based on fair projections allows to recover both theoretically and empirically an almost optimal, fair, dense subgraph hidden in the input data. We also show that, under the small set expansion hypothesis, approximating this problem beyond a factor of 2 is NP-hard and we show a polynomial time algorithm with a matching approximation bound. We further apply our method to k-means. In a previous paper, Chierichetti et al. [NIPS 2017] showed that problems such as k-means can be approximated up to a constant factor while ensuring that none of two protected class (e.g., gender, ethnicity) is disparately impacted. We show that fair projections generalize the concept of fairlet introduced by Chierichietti et al. to any number of protected attributes and improve empirically the quality of the resulting clustering. We also present the first constant-factor approximation for an arbitrary number of protected attributes thus settling an open problem recently addressed in several works.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2018

On the cost of essentially fair clusterings

Clustering is a fundamental tool in data mining. It partitions points in...
research
03/28/2023

Algorithms for subgraph complementation to some classes of graphs

For a class 𝒢 of graphs, the objective of Subgraph Complementation to 𝒢 ...
research
12/28/2018

Fair Coresets and Streaming Algorithms for Fair k-Means Clustering

We study fair clustering problems as proposed by Chierichetti et al. Her...
research
02/24/2023

Intersectional Fairness: A Fractal Approach

The issue of fairness in AI has received an increasing amount of attenti...
research
06/09/2022

Improved Approximation for Fair Correlation Clustering

Correlation clustering is a ubiquitous paradigm in unsupervised machine ...
research
02/15/2018

Fair Clustering Through Fairlets

We study the question of fair clustering under the disparate impact doc...
research
02/23/2020

Fair Adversarial Networks

The influence of human judgement is ubiquitous in datasets used across t...

Please sign up or login with your details

Forgot password? Click here to reset