Random Projections for k-means Clustering

11/21/2010
by   Christos Boutsidis, et al.
0

This paper discusses the topic of dimensionality reduction for k-means clustering. We prove that any set of n points in d dimensions (rows in a matrix A ∈^n × d) can be projected into t = Ω(k / ^2) dimensions, for any ∈ (0,1/3), in O(n d ^-2 k/ (d) ) time, such that with constant probability the optimal k-partition of the point set is preserved within a factor of 2+. The projection is done by post-multiplying A with a d × t random matrix R having entries +1/√(t) or -1/√(t) with equal probability. A numerical implementation of our technique and experiments on a large face images dataset verify the speed and the accuracy of our theoretical results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2015

Isometric sketching of any set via the Restricted Isometry Property

In this paper we show that for the purposes of dimensionality reduction ...
research
07/26/2020

Dimensionality Reduction for k-means Clustering

We present a study on how to effectively reduce the dimensions of the k-...
research
11/08/2018

Performance of Johnson-Lindenstrauss Transform for k-Means and k-Medians Clustering

Consider an instance of Euclidean k-means or k-medians clustering. We sh...
research
10/12/2021

Dimensionality Reduction for k-Distance Applied to Persistent Homology

Given a set P of n points and a constant k, we are interested in computi...
research
06/18/2012

Is margin preserved after random projection?

Random projections have been applied in many machine learning algorithms...
research
01/16/2013

Experiments with Random Projection

Recent theoretical work has identified random projection as a promising ...
research
12/30/2022

Hashing-like Johnson–Lindenstrauss transforms and their extreme singular values

The Johnson–Lindenstrauss (JL) lemma is a powerful tool for dimensionali...

Please sign up or login with your details

Forgot password? Click here to reset