Large sample spectral analysis of graph-based multi-manifold clustering

07/28/2021
by   Nicolas Garcia Trillos, et al.
0

In this work we study statistical properties of graph-based algorithms for multi-manifold clustering (MMC). In MMC the goal is to retrieve the multi-manifold structure underlying a given Euclidean data set when this one is assumed to be obtained by sampling a distribution on a union of manifolds ℳ = ℳ_1 ∪…∪ℳ_N that may intersect with each other and that may have different dimensions. We investigate sufficient conditions that similarity graphs on data sets must satisfy in order for their corresponding graph Laplacians to capture the right geometric information to solve the MMC problem. Precisely, we provide high probability error bounds for the spectral approximation of a tensorized Laplacian on ℳ with a suitable graph Laplacian built from the observations; the recovered tensorized Laplacian contains all geometric information of all the individual underlying manifolds. We provide an example of a family of similarity graphs, which we call annular proximity graphs with angle constraints, satisfying these sufficient conditions. We contrast our family of graphs with other constructions in the literature based on the alignment of tangent planes. Extensive numerical experiments expand the insights that our theory provides on the MMC problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/26/2021

A Graph-based approach to derive the geodesic distance on Statistical manifolds: Application to Multimedia Information Retrieval

In this paper, we leverage the properties of non-Euclidean Geometry to d...
research
08/20/2019

On spectral analysis and extrapolation for processes on branched 1-manifolds

The paper studies processes defined on time domains structured as orient...
research
07/06/2016

Graphons, mergeons, and so on!

In this work we develop a theory of hierarchical clustering for graphs. ...
research
04/20/2020

From graph cuts to isoperimetric inequalities: Convergence rates of Cheeger cuts on data clouds

In this work we study statistical properties of graph-based clustering a...
research
06/05/2017

The Geometry of Nodal Sets and Outlier Detection

Let (M,g) be a compact manifold and let -Δϕ_k = λ_k ϕ_k be the sequence ...
research
04/06/2019

Local Regularization of Noisy Point Clouds: Improved Global Geometric Estimates and Data Analysis

Several data analysis techniques employ similarity relationships between...
research
03/19/2023

Skeleton Regression: A Graph-Based Approach to Estimation with Manifold Structure

We introduce a new regression framework designed to deal with large-scal...

Please sign up or login with your details

Forgot password? Click here to reset