Fair k-Center Clustering for Data Summarization

01/24/2019
by   Matthäus Kleindessner, et al.
0

In data summarization we want to choose k prototypes in order to summarize a data set. We study a setting where the data set comprises several demographic groups and we are restricted to choose k_i prototypes belonging to group i. A common approach to the problem without the fairness constraint is to optimize a centroid-based clustering objective such as k-center. A natural extension then is to incorporate the fairness constraint into the clustering objective. Existing algorithms for doing so run in time super-quadratic in the size of the data set. This is in contrast to the standard k-center objective that can be approximately optimized in linear time. In this paper, we resolve this gap by providing a simple approximation algorithm for the k-center problem under the fairness constraint with running time linear in the size of the data set and k. If the number of demographic groups is small, the approximation guarantee of our algorithm only incurs a constant-factor overhead. We demonstrate the applicability of our algorithm on both synthetic and real data sets.

READ FULL TEXT

page 6

page 19

research
10/26/2020

KFC: A Scalable Approximation Algorithm for k-center Fair Clustering

In this paper, we study the problem of fair clustering on the k-center o...
research
05/31/2023

Doubly Constrained Fair Clustering

The remarkable attention which fair clustering has received in the last ...
research
02/18/2020

How to Solve Fair k-Center in Massive Data Models

Fueled by massive data, important decision making is being automated wit...
research
08/22/2022

Socially Fair Center-based and Linear Subspace Clustering

Center-based clustering (e.g., k-means, k-medians) and clustering using ...
research
07/22/2022

Fair Range k-center

We study the problem of fairness in k-centers clustering on data with di...
research
06/23/2021

Better Algorithms for Individually Fair k-Clustering

We study data clustering problems with ℓ_p-norm objectives (e.g. k-Media...
research
02/22/2023

Improved Coresets for Clustering with Capacity and Fairness Constraints

We study coresets for clustering with capacity and fairness constraints....

Please sign up or login with your details

Forgot password? Click here to reset