Fair Labeled Clustering

05/28/2022
by   Seyed A. Esmaeili, et al.
7

Numerous algorithms have been produced for the fundamental problem of clustering under many different notions of fairness. Perhaps the most common family of notions currently studied is group fairness, in which proportional group representation is ensured in every cluster. We extend this direction by considering the downstream application of clustering and how group fairness should be ensured for such a setting. Specifically, we consider a common setting in which a decision-maker runs a clustering algorithm, inspects the center of each cluster, and decides an appropriate outcome (label) for its corresponding cluster. In hiring for example, there could be two outcomes, positive (hire) or negative (reject), and each cluster would be assigned one of these two outcomes. To ensure group fairness in such a setting, we would desire proportional group representation in every label but not necessarily in every cluster as is done in group fair clustering. We provide algorithms for such problems and show that in contrast to their NP-hard counterparts in group fair clustering, they permit efficient solutions. We also consider a well-motivated alternative setting where the decision-maker is free to assign labels to the clusters regardless of the centers' positions in the metric space. We show that this setting exhibits interesting transitions from computationally hard to easy according to additional constraints on the problem. Moreover, when the constraint parameters take on natural values we show a randomized algorithm for this setting that always achieves an optimal clustering and satisfies the fairness constraints in expectation. Finally, we run experiments on real world datasets that validate the effectiveness of our algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/31/2023

Doubly Constrained Fair Clustering

The remarkable attention which fair clustering has received in the last ...
research
06/14/2021

Fair Clustering Under a Bounded Cost

Clustering is a fundamental unsupervised learning problem where a datase...
research
02/08/2021

Learning to Generate Fair Clusters from Demonstrations

Fair clustering is the process of grouping similar entities together, wh...
research
06/08/2020

A Notion of Individual Fairness for Clustering

A common distinction in fair machine learning, in particular in fair cla...
research
05/09/2019

Proportionally Fair Clustering

We extend the fair machine learning literature by considering the proble...
research
06/19/2020

Fair clustering via equitable group representations

What does it mean for a clustering to be fair? One popular approach seek...
research
06/19/2020

Probabilistic Fair Clustering

In clustering problems, a central decision-maker is given a complete met...

Please sign up or login with your details

Forgot password? Click here to reset