Efficient Manifold and Subspace Approximations with Spherelets

06/26/2017
by   Didong Li, et al.
0

Data lying in a high-dimensional ambient space are commonly thought to have a much lower intrinsic dimension. In particular, the data may be concentrated near a lower-dimensional subspace or manifold. There is an immense literature focused on approximating the unknown subspace, and in exploiting such approximations in clustering, data compression, and building of predictive models. Most of the literature relies on approximating subspaces using a locally linear, and potentially multiscale, dictionary. In this article, we propose a simple and general alternative, which instead uses pieces of spheres, or spherelets, to locally approximate the unknown subspace. Building on this idea, we develop a simple and computationally efficient algorithm for subspace learning and clustering. Results relative to state-of-the-art competitors show dramatic gains in ability to accurately approximate the subspace with orders of magnitude fewer components. This leads to substantial gains in data compressibility, few clusters and hence better interpretability, and much lower MSE based on small to moderate sample sizes. Basic theory on approximation accuracy is presented, and the methods are applied to multiple examples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/29/2019

Geodesic Distance Estimation with Spherelets

Many statistical and machine learning approaches rely on pairwise distan...
research
06/07/2013

Fast greedy algorithm for subspace clustering from corrupted and incomplete data

We describe the Fast Greedy Sparse Subspace Clustering (FGSSC) algorithm...
research
11/07/2018

Scalable Bottom-up Subspace Clustering using FP-Trees for High Dimensional Data

Subspace clustering aims to find groups of similar objects (clusters) th...
research
04/09/2020

Learnable Subspace Clustering

This paper studies the large-scale subspace clustering (LSSC) problem wi...
research
12/10/2019

Transformed Subspace Clustering

Subspace clustering assumes that the data is sepa-rable into separate su...
research
01/11/2013

Robust subspace clustering

Subspace clustering refers to the task of finding a multi-subspace repre...
research
12/14/2022

Learning Invariant Subspaces of Koopman Operators–Part 2: Heterogeneous Dictionary Mixing to Approximate Subspace Invariance

This work builds on the models and concepts presented in part 1 to learn...

Please sign up or login with your details

Forgot password? Click here to reset