Provable Data Clustering via Innovation Search

08/16/2021
by   Weiwei Li, et al.
6

This paper studies the subspace clustering problem in which data points collected from high-dimensional ambient space lie in a union of linear subspaces. Subspace clustering becomes challenging when the dimension of intersection between subspaces is large and most of the self-representation based methods are sensitive to the intersection between the span of clusters. In sharp contrast to the self-representation based methods, a recently proposed clustering method termed Innovation Pursuit, computed a set of optimal directions (directions of innovation) to build the adjacency matrix. This paper focuses on the Innovation Pursuit Algorithm to shed light on its impressive performance when the subspaces are heavily intersected. It is shown that in contrast to most of the existing methods which require the subspaces to be sufficiently incoherent with each other, Innovation Pursuit only requires the innovative components of the subspaces to be sufficiently incoherent with each other. These new sufficient conditions allow the clusters to be strongly close to each other. Motivated by the presented theoretical analysis, a simple yet effective projection based technique is proposed which we show with both numerical and theoretical results that it can boost the performance of Innovation Pursuit.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/08/2022

Provable Clustering of a Union of Linear Manifolds Using Optimal Directions

This paper focuses on the Matrix Factorization based Clustering (MFC) me...
research
12/02/2015

Innovation Pursuit: A New Approach to Subspace Clustering

In subspace clustering, a group of data points belonging to a union of s...
research
12/30/2019

Outlier Detection and Data Clustering via Innovation Search

The idea of Innovation Search was proposed as a data clustering method i...
research
08/16/2017

Active Orthogonal Matching Pursuit for Sparse Subspace Clustering

Sparse Subspace Clustering (SSC) is a state-of-the-art method for cluste...
research
07/18/2013

Robust Subspace Clustering via Thresholding

The problem of clustering noisy and incompletely observed high-dimension...
research
02/02/2020

Provable Noisy Sparse Subspace Clustering using Greedy Neighbor Selection: A Coherence-Based Perspective

Sparse subspace clustering (SSC) using greedy-based neighbor selection, ...
research
06/23/2021

Closed-Form, Provable, and Robust PCA via Leverage Statistics and Innovation Search

The idea of Innovation Search, which was initially proposed for data clu...

Please sign up or login with your details

Forgot password? Click here to reset