Sparse Subspace Clustering: Algorithm, Theory, and Applications

03/05/2012
by   Ehsan Elhamifar, et al.
0

In many real-world problems, we are dealing with collections of high-dimensional data, such as images, videos, text and web documents, DNA microarray data, and more. Often, high-dimensional data lie close to low-dimensional structures corresponding to several classes or categories the data belongs to. In this paper, we propose and study an algorithm, called Sparse Subspace Clustering (SSC), to cluster data points that lie in a union of low-dimensional subspaces. The key idea is that, among infinitely many possible representations of a data point in terms of other points, a sparse representation corresponds to selecting a few points from the same subspace. This motivates solving a sparse optimization program whose solution is used in a spectral clustering framework to infer the clustering of data into subspaces. Since solving the sparse optimization program is in general NP-hard, we consider a convex relaxation and show that, under appropriate conditions on the arrangement of subspaces and the distribution of data, the proposed minimization program succeeds in recovering the desired sparse representations. The proposed algorithm can be solved efficiently and can handle data points near the intersections of subspaces. Another key advantage of the proposed algorithm with respect to the state of the art is that it can deal with data nuisances, such as noise, sparse outlying entries, and missing entries, directly by incorporating the model of the data into the sparse optimization program. We demonstrate the effectiveness of the proposed algorithm through experiments on synthetic data as well as the two real-world problems of motion segmentation and face clustering.

READ FULL TEXT
research
12/23/2014

Approximate Subspace-Sparse Recovery with Corrupted Data via Constrained ℓ_1-Minimization

High-dimensional data often lie in low-dimensional subspaces correspondi...
research
04/09/2020

Learnable Subspace Clustering

This paper studies the large-scale subspace clustering (LSSC) problem wi...
research
01/25/2019

Subspace Clustering of Very Sparse High-Dimensional Data

In this paper we consider the problem of clustering collections of very ...
research
08/16/2017

Active Orthogonal Matching Pursuit for Sparse Subspace Clustering

Sparse Subspace Clustering (SSC) is a state-of-the-art method for cluste...
research
10/19/2019

LSTM-Assisted Evolutionary Self-Expressive Subspace Clustering

Massive volumes of high-dimensional data that evolves over time is conti...
research
01/23/2019

Simultaneous Subspace Clustering and Cluster Number Estimating based on Triplet Relationship

In this paper we propose a unified framework to simultaneously discover ...
research
02/18/2012

Robust computation of linear models by convex relaxation

Consider a dataset of vector-valued observations that consists of noisy ...

Please sign up or login with your details

Forgot password? Click here to reset