Performance of Johnson-Lindenstrauss Transform for k-Means and k-Medians Clustering

11/08/2018
by   Konstantin Makarychev, et al.
0

Consider an instance of Euclidean k-means or k-medians clustering. We show that the cost of the optimal solution is preserved up to a factor of (1+ε) under a projection onto a random O((k / ε) / ε^2)-dimensional subspace. Further, the cost of every clustering is preserved within (1+ε). More generally, our result applies to any dimension reduction map satisfying a mild sub-Gaussian-tail condition. Our bound on the dimension is nearly optimal. Additionally, our result applies to Euclidean k-clustering with the distances raised to the p-th power for any constant p. For k-means, our result resolves an open problem posed by Cohen, Elder, Musco, Musco, and Persu (STOC 2015); for k-medians, it answers a question raised by Kannan.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/01/2022

The Johnson-Lindenstrauss Lemma for Clustering and Subspace Approximation: From Coresets to Dimension Reduction

We study the effect of Johnson-Lindenstrauss transforms in various Eucli...
research
11/21/2010

Random Projections for k-means Clustering

This paper discusses the topic of dimensionality reduction for k-means c...
research
12/13/2017

Greedy spanners are optimal in doubling metrics

We show that the greedy spanner algorithm constructs a (1+ϵ)-spanner of ...
research
02/01/2018

Sensitivity Sampling Over Dynamic Geometric Data Streams with Applications to k-Clustering

Sensitivity based sampling is crucial for constructing nearly-optimal co...
research
10/26/2018

Hanson-Wright inequality in Hilbert spaces with application to K-means clustering for non-Euclidean data

We derive a dimensional-free Hanson-Wright inequality for quadratic form...
research
09/30/2020

Clustering under Perturbation Stability in Near-Linear Time

We consider the problem of center-based clustering in low-dimensional Eu...
research
12/11/2018

Robust Bregman Clustering

Using a trimming approach, we investigate a k-means type method based on...

Please sign up or login with your details

Forgot password? Click here to reset