megaman: Manifold Learning with Millions of points

03/09/2016
by   James McQueen, et al.
0

Manifold Learning is a class of algorithms seeking a low-dimensional non-linear representation of high-dimensional data. Thus manifold learning algorithms are, at least in theory, most applicable to high-dimensional data and sample sizes to enable accurate estimation of the manifold. Despite this, most existing manifold learning implementations are not particularly scalable. Here we present a Python package that implements a variety of manifold learning algorithms in a modular and scalable fashion, using fast approximate neighbors searches and fast sparse eigendecompositions. The package incorporates theoretical advances in manifold learning, such as the unbiased Laplacian estimator and the estimation of the embedding distortion by the Riemannian metric method. In benchmarks, even on a single-core desktop computer, our code embeds millions of data points in minutes, and takes just 200 minutes to embed the main sample of galaxy spectra from the Sloan Digital Sky Survey --- consisting of 0.6 million samples in 3750-dimensions --- a task which has not previously been possible.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/15/2010

An Explicit Nonlinear Mapping for Manifold Learning

Manifold learning is a hot research topic in the field of computer scien...
research
06/25/2015

Diffusion Nets

Non-linear manifold learning enables high-dimensional data analysis, but...
research
12/28/2016

Optimal bandwidth estimation for a fast manifold learning algorithm to detect circular structure in high-dimensional data

We provide a way to infer about existence of topological circularity in ...
research
02/22/2021

Manifold learning with approximate nearest neighbors

Manifold learning algorithms are valuable tools for the analysis of high...
research
01/07/2009

A Theoretical Analysis of Joint Manifolds

The emergence of low-cost sensor architectures for diverse modalities ha...
research
11/26/2021

Conditional Manifold Learning

This paper addresses a problem called "conditional manifold learning", w...
research
06/23/2018

The Sparse Manifold Transform

We present a signal representation framework called the sparse manifold...

Please sign up or login with your details

Forgot password? Click here to reset