Subspace Clustering via Thresholding and Spectral Clustering

03/15/2013
by   Reinhard Heckel, et al.
0

We consider the problem of clustering a set of high-dimensional data points into sets of low-dimensional linear subspaces. The number of subspaces, their dimensions, and their orientations are unknown. We propose a simple and low-complexity clustering algorithm based on thresholding the correlations between the data points followed by spectral clustering. A probabilistic performance analysis shows that this algorithm succeeds even when the subspaces intersect, and when the dimensions of the subspaces scale (up to a log-factor) linearly in the ambient dimension. Moreover, we prove that the algorithm also succeeds for data points that are subject to erasures with the number of erasures scaling (up to a log-factor) linearly in the ambient dimension. Finally, we propose a simple scheme that provably detects outliers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/18/2013

Robust Subspace Clustering via Thresholding

The problem of clustering noisy and incompletely observed high-dimension...
research
05/15/2013

Noisy Subspace Clustering via Thresholding

We consider the problem of clustering noisy high-dimensional data points...
research
03/13/2014

Neighborhood Selection for Thresholding-based Subspace Clustering

Subspace clustering refers to the problem of clustering high-dimensional...
research
12/19/2011

A geometric analysis of subspace clustering with outliers

This paper considers the problem of clustering a collection of unlabeled...
research
06/27/2021

Linear-Time Approximation Scheme for k-Means Clustering of Affine Subspaces

In this paper, we present a linear-time approximation scheme for k-means...
research
06/23/2023

On the Convergence Rate of Gaussianization with Random Rotations

Gaussianization is a simple generative model that can be trained without...
research
04/04/2017

Polynomial Time and Sample Complexity for Non-Gaussian Component Analysis: Spectral Methods

The problem of Non-Gaussian Component Analysis (NGCA) is about finding a...

Please sign up or login with your details

Forgot password? Click here to reset