Minimum Spectral Connectivity Projection Pursuit

09/04/2015
by   David P. Hofmeyr, et al.
0

We study the problem of determining the optimal low dimensional projection for maximising the separability of a binary partition of an unlabelled dataset, as measured by spectral graph theory. This is achieved by finding projections which minimise the second eigenvalue of the graph Laplacian of the projected data, which corresponds to a non-convex, non-smooth optimisation problem. We show that the optimal univariate projection based on spectral connectivity converges to the vector normal to the maximum margin hyperplane through the data, as the scaling parameter is reduced to zero. This establishes a connection between connectivity as measured by spectral graph theory and maximal Euclidean separation. The computational cost associated with each eigen-problem is quadratic in the number of data. To mitigate this issue, we propose an approximation method using microclusters with provable approximation error bounds. Combining multiple binary partitions within a divisive hierarchical model allows us to construct clustering solutions admitting clusters with varying scales and lying within different subspaces. We evaluate the performance of the proposed method on a large collection of benchmark datasets and find that it compares favourably with existing methods for projection pursuit and dimension reduction for data clustering.

READ FULL TEXT
research
04/08/2021

Visual Diagnostics for Constrained Optimisation with Application to Guided Tours

A guided tour helps to visualise high-dimensional data by showing low-di...
research
12/16/2018

Connecting Spectral Clustering to Maximum Margins and Level Sets

We study the connections between spectral clustering and the problems of...
research
12/16/2019

Projection pursuit with applications to scRNA sequencing data

In this paper, we explore the limitations of PCA as a dimension reductio...
research
06/09/2021

Large-scale optimal transport map estimation using projection pursuit

This paper studies the estimation of large-scale optimal transport maps ...
research
10/28/2019

The spectral dimension of simplicial complexes: a renormalization group theory

Simplicial complexes are increasingly used to study complex system struc...
research
11/24/2022

Projection pursuit adaptation on polynomial chaos expansions

The present work addresses the issue of accurate stochastic approximatio...
research
06/14/2023

Multi-class Graph Clustering via Approximated Effective p-Resistance

This paper develops an approximation to the (effective) p-resistance and...

Please sign up or login with your details

Forgot password? Click here to reset