Statistical data analysis in the Wasserstein space

07/19/2019
by   Jérémie Bigot, et al.
0

This paper is concerned by statistical inference problems from a data set whose elements may be modeled as random probability measures such as multiple histograms or point clouds. We propose to review recent contributions in statistics on the use of Wasserstein distances and tools from optimal transport to analyse such data. In particular, we highlight the benefits of using the notions of barycenter and geodesic PCA in the Wasserstein space for the purpose of learning the principal modes of geometric variation in a dataset. In this setting, we discuss existing works and we present some research perspectives related to the emerging field of statistical optimal transport.

READ FULL TEXT
research
06/26/2015

Principal Geodesic Analysis for Probability Measures under the Optimal Transport Metric

Given a family of probability measures in P(X), the space of probability...
research
01/28/2022

Optimal Transport Tools (OTT): A JAX Toolbox for all things Wasserstein

Optimal transport tools (OTT-JAX) is a Python toolbox that can solve opt...
research
03/17/2017

PSF field learning based on Optimal Transport Distances

Context: in astronomy, observing large fractions of the sky within a rea...
research
10/19/2022

Stability of Entropic Wasserstein Barycenters and application to random geometric graphs

As interest in graph data has grown in recent years, the computation of ...
research
08/29/2018

Wasserstein is all you need

We propose a unified framework for building unsupervised representations...
research
12/15/2018

Mapper Comparison with Wasserstein Metrics

The challenge of describing model drift is an open question in unsupervi...
research
05/18/2018

Wasserstein Coresets for Lipschitz Costs

Sparsification is becoming more and more relevant with the proliferation...

Please sign up or login with your details

Forgot password? Click here to reset