Geodesic Learning via Unsupervised Decision Forests

07/05/2019
by   Meghana Madhyastha, et al.
4

Geodesic distance is the shortest path between two points in a Riemannian manifold. Manifold learning algorithms, such as Isomap, seek to learn a manifold that preserves geodesic distances. However, such methods operate on the ambient dimensionality, and are therefore fragile to noise dimensions. We developed an unsupervised random forest method (URerF) to approximately learn geodesic distances in linear and nonlinear manifolds with noise. URerF operates on low-dimensional sparse linear combinations of features, rather than the full observed dimensionality. To choose the optimal split in a computationally efficient fashion, we developed a fast Bayesian Information Criterion statistic for Gaussian mixture models. We introduce geodesic precision-recall curves which quantify performance relative to the true latent manifold. Empirical results on simulated and real data demonstrate that URerF is robust to high-dimensional noise, where as other methods, such as Isomap, UMAP, and FLANN, quickly deteriorate in such settings. In particular, URerF is able to estimate geodesic distances on a real connectome dataset better than other approaches.

READ FULL TEXT

page 1

page 8

page 9

page 10

research
06/18/2020

Rehabilitating Isomap: Euclidean Representation of Geodesic Structure

Manifold learning techniques for nonlinear dimension reduction assume th...
research
07/03/2023

Supervised Manifold Learning via Random Forest Geometry-Preserving Proximities

Manifold learning approaches seek the intrinsic, low-dimensional data st...
research
06/29/2019

Geodesic Distance Estimation with Spherelets

Many statistical and machine learning approaches rely on pairwise distan...
research
03/27/2023

Manifold Learning by Mixture Models of VAEs for Inverse Problems

Representing a manifold of very high-dimensional data with generative mo...
research
05/30/2019

Power Weighted Shortest Paths for Unsupervised Learning

We study the use of power weighted shortest path distance functions for ...
research
12/17/2020

Balancing Geometry and Density: Path Distances on High-Dimensional Data

New geometric and computational analyses of power-weighted shortest-path...
research
09/22/2021

The Curse Revisited: a Newly Quantified Concept of Meaningful Distances for Learning from High-Dimensional Noisy Data

Distances between data points are widely used in point cloud representat...

Please sign up or login with your details

Forgot password? Click here to reset