Fair Clustering Under a Bounded Cost

06/14/2021
by   Seyed A. Esmaeili, et al.
4

Clustering is a fundamental unsupervised learning problem where a dataset is partitioned into clusters that consist of nearby points in a metric space. A recent variant, fair clustering, associates a color with each point representing its group membership and requires that each color has (approximately) equal representation in each cluster to satisfy group fairness. In this model, the cost of the clustering objective increases due to enforcing fairness in the algorithm. The relative increase in the cost, the ”price of fairness,” can indeed be unbounded. Therefore, in this paper we propose to treat an upper bound on the clustering objective as a constraint on the clustering problem, and to maximize equality of representation subject to it. We consider two fairness objectives: the group utilitarian objective and the group egalitarian objective, as well as the group leximin objective which generalizes the group egalitarian objective. We derive fundamental lower bounds on the approximation of the utilitarian and egalitarian objectives and introduce algorithms with provable guarantees for them. For the leximin objective we introduce an effective heuristic algorithm. We further derive impossibility results for other natural fairness objectives. We conclude with experimental results on real-world datasets that demonstrate the validity of our algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2022

Fair Labeled Clustering

Numerous algorithms have been produced for the fundamental problem of cl...
research
06/19/2020

Probabilistic Fair Clustering

In clustering problems, a central decision-maker is given a complete met...
research
05/29/2019

Clustering without Over-Representation

In this paper we consider clustering problems in which each point is end...
research
06/19/2019

Clustering with Fairness Constraints: A Flexible and Scalable Approach

This study investigates a general variational formulation of fair cluste...
research
01/25/2023

Group fairness in dynamic refugee assignment

Ensuring that refugees and asylum seekers thrive (e.g., find employment)...
research
05/28/2021

Deep Fair Discriminative Clustering

Deep clustering has the potential to learn a strong representation and h...
research
10/11/2020

Representativity Fairness in Clustering

Incorporating fairness constructs into machine learning algorithms is a ...

Please sign up or login with your details

Forgot password? Click here to reset