Fast greedy algorithm for subspace clustering from corrupted and incomplete data

06/07/2013
by   Alexander Petukhov, et al.
0

We describe the Fast Greedy Sparse Subspace Clustering (FGSSC) algorithm providing an efficient method for clustering data belonging to a few low-dimensional linear or affine subspaces. The main difference of our algorithm from predecessors is its ability to work with noisy data having a high rate of erasures (missed entries with the known coordinates) and errors (corrupted entries with unknown coordinates). We discuss here how to implement the fast version of the greedy algorithm with the maximum efficiency whose greedy strategy is incorporated into iterations of the basic algorithm. We provide numerical evidences that, in the subspace clustering capability, the fast greedy algorithm outperforms not only the existing state-of-the art SSC algorithm taken by the authors as a basic algorithm but also the recent GSSC algorithm. At the same time, its computational cost is only slightly higher than the cost of SSC. The numerical evidence of the algorithm significant advantage is presented for a few synthetic models as well as for the Extended Yale B dataset of facial images. In particular, the face recognition misclassification rate turned out to be 6-20 times lower than for the SSC algorithm. We provide also the numerical evidence that the FGSSC algorithm is able to perform clustering of corrupted data efficiently even when the sum of subspace dimensions significantly exceeds the dimension of the ambient space.

READ FULL TEXT

page 9

page 10

page 11

research
10/19/2020

EPTAS for k-means Clustering of Affine Subspaces

We consider a generalization of the fundamental k-means clustering for d...
research
10/29/2019

GLIMPS: A Greedy Mixed Integer Approach for Super Robust Matched Subspace Detection

Due to diverse nature of data acquisition and modern applications, many ...
research
06/26/2017

Efficient Manifold and Subspace Approximations with Spherelets

Data lying in a high-dimensional ambient space are commonly thought to h...
research
09/09/2015

Shape Interaction Matrix Revisited and Robustified: Efficient Subspace Clustering with Corrupted and Incomplete Data

The Shape Interaction Matrix (SIM) is one of the earliest approaches to ...
research
09/14/2017

Subspace Clustering using Ensembles of K-Subspaces

We present a novel approach to the subspace clustering problem that leve...
research
01/01/2018

Theoretical Analysis of Sparse Subspace Clustering with Missing Entries

Sparse Subspace Clustering (SSC) is a popular unsupervised machine learn...
research
07/08/2017

Subspace Clustering with Missing and Corrupted Data

Subspace clustering is the process of identifying a union of subspaces m...

Please sign up or login with your details

Forgot password? Click here to reset