Clustering under Perturbation Stability in Near-Linear Time

09/30/2020
by   Pankaj K. Agarwal, et al.
0

We consider the problem of center-based clustering in low-dimensional Euclidean spaces under the perturbation stability assumption. An instance is α-stable if the underlying optimal clustering continues to remain optimal even when all pairwise distances are arbitrarily perturbed by a factor of at most α. Our main contribution is in presenting efficient exact algorithms for α-stable clustering instances whose running times depend near-linearly on the size of the data set when α≥ 2 + √(3). For k-center and k-means problems, our algorithms also achieve polynomial dependence on the number of clusters, k, when α≥ 2 + √(3) + ϵ for any constant ϵ > 0 in any fixed dimension. For k-median, our algorithms have polynomial dependence on k for α > 5 in any fixed dimension; and for α≥ 2 + √(3) in two dimensions. Our algorithms are simple, and only require applying techniques such as local search or dynamic programming to a suitably modified metric space, combined with careful choice of data structures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/14/2018

Exact Algorithms and Lower Bounds for Stable Instances of Euclidean k-Means

We investigate the complexity of solving stable or perturbation-resilien...
research
04/28/2018

Clustering Perturbation Resilient Instances

Euclidean k-means is a problem that is NP-hard in the worst-case but oft...
research
06/11/2018

Perturbation Resilient Clustering for k-Center and Related Problems via LP Relaxations

We consider clustering in the perturbation resilience model that has bee...
research
12/04/2017

Clustering Stable Instances of Euclidean k-means

The Euclidean k-means problem is arguably the most widely-studied cluste...
research
12/20/2018

Near-Linear Time Approximation Schemes for Clustering in Doubling Metrics

We consider the classic Facility Location, k-Median, and k-Means problem...
research
10/19/2022

Near-optimal Coresets for Robust Clustering

We consider robust clustering problems in ℝ^d, specifically k-clustering...
research
11/08/2018

Performance of Johnson-Lindenstrauss Transform for k-Means and k-Medians Clustering

Consider an instance of Euclidean k-means or k-medians clustering. We sh...

Please sign up or login with your details

Forgot password? Click here to reset