KFC: A Scalable Approximation Algorithm for k-center Fair Clustering

10/26/2020
by   Elfarouk Harb, et al.
0

In this paper, we study the problem of fair clustering on the k-center objective. In fair clustering, the input is N points, each belonging to at least one of l protected groups, e.g. male, female, Asian, Hispanic. The objective is to cluster the N points into k clusters to minimize a classical clustering objective function. However, there is an additional constraint that each cluster needs to be fair, under some notion of fairness. This ensures that no group is either "over-represented" or "under-represented" in any cluster. Our work builds on the work of Chierichetti et al. (NIPS 2017), Bera et al. (NeurIPS 2019), Ahmadian et al. (KDD 2019), and Bercea et al. (APPROX 2019). We obtain a randomized 3-approximation algorithm for the k-center objective function, beating the previous state of the art (4-approximation). We test our algorithm on real datasets, and show that our algorithm is effective in finding good clusters without over-representation or under-representation, surpassing the current state of the art in runtime speed, clustering cost, while achieving similar fairness violations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/26/2021

Improved Approximation Algorithms for Individually Fair Clustering

We consider the k-clustering problem with ℓ_p-norm cost, which includes ...
research
01/24/2019

Fair k-Center Clustering for Data Summarization

In data summarization we want to choose k prototypes in order to summari...
research
06/09/2022

Improved Approximation for Fair Correlation Clustering

Correlation clustering is a ubiquitous paradigm in unsupervised machine ...
research
09/02/2021

Efficient Algorithms For Fair Clustering with a New Fairness Notion

We revisit the problem of fair clustering, first introduced by Chieriche...
research
05/27/2022

Generalized Reductions: Making any Hierarchical Clustering Fair and Balanced with Low Cost

Clustering is a fundamental building block of modern statistical analysi...
research
05/29/2019

Clustering without Over-Representation

In this paper we consider clustering problems in which each point is end...
research
02/10/2019

Scalable Fair Clustering

We study the fair variant of the classic k-median problem introduced by ...

Please sign up or login with your details

Forgot password? Click here to reset