Coreset-based Strategies for Robust Center-type Problems

02/18/2020
by   Andrea Pietracaprina, et al.
0

Given a dataset V of points from some metric space, the popular k-center problem requires to identify a subset of k points (centers) in V minimizing the maximum distance of any point of V from its closest center. The robust formulation of the problem features a further parameter z and allows up to z points of V (outliers) to be disregarded when computing the maximum distance from the centers. In this paper, we focus on two important constrained variants of the robust k-center problem, namely, the Robust Matroid Center (RMC) problem, where the set of returned centers are constrained to be an independent set of a matroid of rank k built on V, and the Robust Knapsack Center (RKC) problem, where each element i∈ V is given a positive weight w_i<1 and the aggregate weight of the returned centers must be at most 1. We devise coreset-based strategies for the two problems which yield efficient sequential, MapReduce, and Streaming algorithms. More specifically, for any fixed ϵ>0, the algorithms return solutions featuring a (3+ϵ)-approximation ratio, which is a mere additive term ϵ away from the 3-approximations achievable by the best known polynomial-time sequential algorithms for the two problems. Moreover, the algorithms obliviously adapt to the intrinsic complexity of the dataset, captured by its doubling dimension D. For wide ranges of the parameters k,z,ϵ, D, we obtain a sequential algorithm with running time linear in |V|, and MapReduce/Streaming algorithms with few rounds/passes and substantially sublinear local/working memory.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/16/2022

Distributed k-Means with Outliers in General Metrics

Center-based clustering is a pivotal primitive for unsupervised learning...
research
04/29/2019

Accurate MapReduce Algorithms for k-median and k-means in General Metric Spaces

Center-based clustering is a fundamental primitive for data analysis and...
research
10/27/2019

Computing a Geodesic Two-Center of Points in a Simple Polygon

Given a simple polygon P and a set Q of points contained in P, we consid...
research
05/06/2018

Generalized Center Problems with Outliers

We study the F-center problem with outliers: given a metric space (X,d),...
research
09/16/2019

Streaming PTAS for Constrained k-Means

We generalise the results of Bhattacharya et al. (Journal of Computing S...
research
02/24/2023

k-Center Clustering with Outliers in the MPC and Streaming Model

Given a point set P ⊆ X of size n in a metric space (X,dist) of doubling...
research
01/13/2022

Approximate the individually fair k-center with outliers

In this paper, we propose and investigate the individually fair k-center...

Please sign up or login with your details

Forgot password? Click here to reset