Conditional Wasserstein Barycenters and Interpolation/Extrapolation of Distributions

07/20/2021
by   Jianing Fan, et al.
0

Increasingly complex data analysis tasks motivate the study of the dependency of distributions of multivariate continuous random variables on scalar or vector predictors. Statistical regression models for distributional responses so far have primarily been investigated for the case of one-dimensional response distributions. We investigate here the case of multivariate response distributions while adopting the 2-Wasserstein metric in the distribution space. The challenge is that unlike the situation in the univariate case, the optimal transports that correspond to geodesics in the space of distributions with the 2-Wasserstein metric do not have an explicit representation for multivariate distributions. We show that under some regularity assumptions the conditional Wasserstein barycenters constructed for a geodesic in the Euclidean predictor space form a corresponding geodesic in the Wasserstein distribution space and demonstrate how the notion of conditional barycenters can be harnessed to interpolate as well as extrapolate multivariate distributions. The utility of distributional inter- and extrapolation is explored in simulations and examples. We study both global parametric-like and local smoothing-like models to implement conditional Wasserstein barycenters and establish asymptotic convergence properties for the corresponding estimates. For algorithmic implementation we make use of a Sinkhorn entropy-penalized algorithm. Conditional Wasserstein barycenters and distribution extrapolation are illustrated with applications in climate science and studies of aging.

READ FULL TEXT

page 4

page 18

page 19

page 20

page 22

page 25

page 41

page 42

research
06/18/2023

Sliced Wasserstein Regression

While statistical modeling of distributional data has gained increased a...
research
06/17/2020

Wasserstein Regression

The analysis of samples of random objects that do not lie in a vector sp...
research
07/12/2023

Distribution-on-Distribution Regression with Wasserstein Metric: Multivariate Gaussian Case

Distribution data refers to a data set where each sample is represented ...
research
08/13/2021

Fréchet single index models for object response regression

With the availability of more non-euclidean data objects, statisticians ...
research
08/24/2023

Wasserstein Regression with Empirical Measures and Density Estimation for Sparse Data

The problem of modeling the relationship between univariate distribution...
research
09/12/2022

Wasserstein Distributional Learning

Learning conditional densities and identifying factors that influence th...
research
10/29/2019

Wasserstein F-tests and Confidence Bands for the Frèchet Regression of Density Response Curves

Data consisting of samples of probability density functions are increasi...

Please sign up or login with your details

Forgot password? Click here to reset