Fair k-Center: a Coreset Approach in Low Dimensions

02/20/2023
by   Jinxiang Gan, et al.
0

Center-based clustering techniques are fundamental in some areas of machine learning such as data summarization. Generic k-center algorithms can produce biased cluster representatives so there has been a recent interest in fair k-center clustering. Our main theoretical contributions are two new (3+ϵ)-approximation algorithms for solving the fair k-center problem in (1) the dynamic incremental, i.e., one-pass streaming, model and (2) the MapReduce model. Our dynamic incremental algorithm is the first such algorithm for this problem (previous streaming algorithms required two passes) and our MapReduce one improves upon the previous approximation factor of (17+ϵ). Both algorithms work by maintaining a small coreset to represent the full point set and their analysis requires that the underlying metric has finite-doubling dimension. We also provide related heuristics for higher dimensional data and experimental results that compare the performance of our algorithms to existing ones.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/18/2020

How to Solve Fair k-Center in Massive Data Models

Fueled by massive data, important decision making is being automated wit...
research
02/26/2018

Improved MapReduce and Streaming Algorithms for k-Center Clustering (with Outliers)

We present efficient MapReduce and Streaming algorithms for the k-center...
research
06/22/2022

Constant-Factor Approximation Algorithms for Socially Fair k-Clustering

We study approximation algorithms for the socially fair (ℓ_p, k)-cluster...
research
02/20/2023

Fully Dynamic k-Center in Low Dimensions via Approximate Furthest Neighbors

Let P be a set of points in some metric space. The approximate furthest ...
research
06/09/2021

A New Notion of Individually Fair Clustering: α-Equitable k-Center

Clustering is a fundamental problem in unsupervised machine learning, an...
research
08/22/2022

Socially Fair Center-based and Linear Subspace Clustering

Center-based clustering (e.g., k-means, k-medians) and clustering using ...
research
03/02/2022

Efficient Dynamic Clustering: Capturing Patterns from Historical Cluster Evolution

Clustering aims to group unlabeled objects based on similarity inherent ...

Please sign up or login with your details

Forgot password? Click here to reset