Projected Statistical Methods for Distributional Data on the Real Line with the Wasserstein Metric

01/22/2021
by   Matteo Pegoraro, et al.
0

We present a novel class of projected methods, to perform statistical analysis on a data set of probability distributions on the real line, with the 2-Wasserstein metric. We focus in particular on Principal Component Analysis (PCA) and regression. To define these models, we exploit a representation of the Wasserstein space closely related to its weak Riemannian structure, by mapping the data to a suitable linear space and using a metric projection operator to constrain the results in the Wasserstein space. By carefully choosing the tangent point, we are able to derive fast empirical methods, exploiting a constrained B-spline approximation. As a byproduct of our approach, we are also able to derive faster routines for previous work on PCA for distributions. By means of simulation studies, we compare our approaches to previously proposed methods, showing that our projected PCA has similar performance for a fraction of the computational cost and that the projected regression is extremely flexible even under misspecification. Several theoretical properties of the models are investigated and asymptotic consistency is proven. Two real world applications to Covid-19 mortality in the US and wind speed forecasting are discussed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/22/2020

Two-sample Test using Projected Wasserstein Distance: Breaking the Curse of Dimensionality

We develop a projected Wasserstein distance for the two-sample test, a f...
research
06/17/2020

Wasserstein Regression

The analysis of samples of random objects that do not lie in a vector sp...
research
04/05/2023

Wasserstein Principal Component Analysis for Circular Measures

We consider the 2-Wasserstein space of probability measures supported on...
research
11/05/2022

Efficient Convex PCA with applications to Wasserstein geodesic PCA and ranked data

Convex PCA, which was introduced by Bigot et al., is a dimension reducti...
research
10/10/2019

Gromov-Wasserstein Averaging in a Riemannian Framework

We introduce a theoretical framework for performing statistical tasks—in...
research
02/12/2021

Two-sample Test with Kernel Projected Wasserstein Distance

We develop a kernel projected Wasserstein distance for the two-sample te...
research
01/12/2023

Self-Attention Amortized Distributional Projection Optimization for Sliced Wasserstein Point-Cloud Reconstruction

Max sliced Wasserstein (Max-SW) distance has been widely known as a solu...

Please sign up or login with your details

Forgot password? Click here to reset