DeepAI
Log In Sign Up

Diversity-aware k-median : Clustering with fair center representation

06/22/2021
by   Suhas Thejaswi, et al.
0

We introduce a novel problem for diversity-aware clustering. We assume that the potential cluster centers belong to a set of groups defined by protected attributes, such as ethnicity, gender, etc. We then ask to find a minimum-cost clustering of the data into k clusters so that a specified minimum number of cluster centers are chosen from each group. We thus require that all groups are represented in the clustering solution as cluster centers, according to specified requirements. More precisely, we are given a set of clients C, a set of facilities F, a collection ℱ={F_1,…,F_t} of facility groups F_i ⊆F, budget k, and a set of lower-bound thresholds R={r_1,…,r_t}, one for each group in ℱ. The diversity-aware k-median problem asks to find a set S of k facilities in F such that |S ∩ F_i| ≥ r_i, that is, at least r_i centers in S are from group F_i, and the k-median cost ∑_c ∈ Cmin_s ∈ S d(c,s) is minimized. We show that in the general case where the facility groups may overlap, the diversity-aware k-median problem is -hard, fixed-parameter intractable, and inapproximable to any multiplicative factor. On the other hand, when the facility groups are disjoint, approximation algorithms can be obtained by reduction to the matroid median and red-blue median problems. Experimentally, we evaluate our approximation methods for the tractable cases, and present a relaxation-based heuristic for the theoretically intractable case, which can provide high-quality and efficient solutions for real-world datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

06/12/2021

FPT Approximation for Socially Fair Clustering

In this work, we study the socially fair k-median/k-means problem. We ar...
02/03/2022

Fair Representation Clustering with Several Protected Classes

We study the problem of fair k-median where each cluster is required to ...
06/22/2022

Constant-Factor Approximation Algorithms for Socially Fair k-Clustering

We study approximation algorithms for the socially fair (ℓ_p, k)-cluster...
02/15/2018

Fair Clustering Through Fairlets

We study the question of fair clustering under the disparate impact doc...
12/30/2022

A Global Optimization Algorithm for K-Center Clustering of One Billion Samples

This paper presents a practical global optimization algorithm for the K-...
12/09/2020

Participatory Budgeting with Project Groups

We study a generalization of the standard approval-based model of partic...
06/19/2020

Fair clustering via equitable group representations

What does it mean for a clustering to be fair? One popular approach seek...