Principal Geodesic Analysis for Probability Measures under the Optimal Transport Metric

06/26/2015
by   Vivien Seguy, et al.
0

Given a family of probability measures in P(X), the space of probability measures on a Hilbert space X, our goal in this paper is to highlight one ore more curves in P(X) that summarize efficiently that family. We propose to study this problem under the optimal transport (Wasserstein) geometry, using curves that are restricted to be geodesic segments under that metric. We show that concepts that play a key role in Euclidean PCA, such as data centering or orthogonality of principal directions, find a natural equivalent in the optimal transport geometry, using Wasserstein means and differential geometry. The implementation of these ideas is, however, computationally challenging. To achieve scalable algorithms that can handle thousands of measures, we propose to use a relaxed definition for geodesics and regularized optimal transport distances. The interest of our approach is demonstrated on images seen either as shapes or color histograms.

READ FULL TEXT
research
07/19/2019

Statistical data analysis in the Wasserstein space

This paper is concerned by statistical inference problems from a data se...
research
10/14/2019

Quantitative stability of optimal transport maps and linearization of the 2-Wasserstein space

This work studies an explicit embedding of the set of probability measur...
research
10/22/2020

Fast and Smooth Interpolation on Wasserstein Space

We propose a new method for smoothly interpolating probability measures ...
research
11/10/2020

Unbalanced Optimal Transport using Integral Probability Metric Regularization

Unbalanced Optimal Transport (UOT) is the generalization of classical op...
research
09/21/2022

Quantitative Stability of Barycenters in the Wasserstein Space

Wasserstein barycenters define averages of probability measures in a geo...
research
04/05/2023

Wasserstein Principal Component Analysis for Circular Measures

We consider the 2-Wasserstein space of probability measures supported on...
research
05/22/2018

Large Scale computation of Means and Clusters for Persistence Diagrams using Optimal Transport

Persistence diagrams (PDs) are now routinely used to summarize the under...

Please sign up or login with your details

Forgot password? Click here to reset