Linearized Wasserstein dimensionality reduction with approximation guarantees

02/14/2023
by   Alexander Cloninger, et al.
0

We introduce LOT Wassmap, a computationally feasible algorithm to uncover low-dimensional structures in the Wasserstein space. The algorithm is motivated by the observation that many datasets are naturally interpreted as probability measures rather than points in ℝ^n, and that finding low-dimensional descriptions of such datasets requires manifold learning algorithms in the Wasserstein space. Most available algorithms are based on computing the pairwise Wasserstein distance matrix, which can be computationally challenging for large datasets in high dimensions. Our algorithm leverages approximation schemes such as Sinkhorn distances and linearized optimal transport to speed-up computations, and in particular, avoids computing a pairwise distance matrix. We provide guarantees on the embedding quality under such approximations, including when explicit descriptions of the probability measures are not available and one must deal with finite samples instead. Experiments demonstrate that LOT Wassmap attains correct embeddings and that the quality improves with increased sample size. We also show how LOT Wassmap significantly reduces the computational cost when compared to algorithms that depend on pairwise distance computations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2022

Wassmap: Wasserstein Isometric Mapping for Image Manifold Learning

In this paper, we propose Wasserstein Isometric Mapping (Wassmap), a par...
research
10/12/2020

Permutation invariant networks to learn Wasserstein metrics

Understanding the space of probability measures on a metric space equipp...
research
08/20/2020

Linear Optimal Transport Embedding: Provable fast Wasserstein distance computation and classification for nonlinear problems

Discriminating between distributions is an important problem in a number...
research
10/20/2017

Learning Wasserstein Embeddings

The Wasserstein distance received a lot of attention recently in the com...
research
06/21/2020

The Gaussian Transform

We introduce the Gaussian transform (GT), an optimal transport inspired ...
research
11/25/2020

Wasserstein k-means with sparse simplex projection

This paper presents a proposal of a faster Wasserstein k-means algorithm...
research
08/02/2023

Computing the Distance between unbalanced Distributions – The flat Metric

We provide an implementation to compute the flat metric in any dimension...

Please sign up or login with your details

Forgot password? Click here to reset