Wasserstein Distributional Learning

09/12/2022
by   Chengliang Tang, et al.
43

Learning conditional densities and identifying factors that influence the entire distribution are vital tasks in data-driven applications. Conventional approaches work mostly with summary statistics, and are hence inadequate for a comprehensive investigation. Recently, there have been developments on functional regression methods to model density curves as functional outcomes. A major challenge for developing such models lies in the inherent constraint of non-negativity and unit integral for the functional space of density outcomes. To overcome this fundamental issue, we propose Wasserstein Distributional Learning (WDL), a flexible density-on-scalar regression modeling framework that starts with the Wasserstein distance W_2 as a proper metric for the space of density outcomes. We then introduce a heterogeneous and flexible class of Semi-parametric Conditional Gaussian Mixture Models (SCGMM) as the model class 𝔉⊗𝒯. The resulting metric space (𝔉⊗𝒯, W_2) satisfies the required constraints and offers a dense and closed functional subspace. For fitting the proposed model, we further develop an efficient algorithm based on Majorization-Minimization optimization with boosted trees. Compared with methods in the previous literature, WDL better characterizes and uncovers the nonlinear dependence of the conditional densities, and their derived summary statistics. We demonstrate the effectiveness of the WDL framework through simulations and real-world applications.

READ FULL TEXT
research
10/29/2019

Wasserstein F-tests and Confidence Bands for the Frèchet Regression of Density Response Curves

Data consisting of samples of probability density functions are increasi...
research
08/24/2023

Wasserstein Regression with Empirical Measures and Density Estimation for Sparse Data

The problem of modeling the relationship between univariate distribution...
research
12/18/2018

Wasserstein Covariance for Multiple Random Densities

A common feature of methods for analyzing samples of probability density...
research
07/20/2021

Conditional Wasserstein Barycenters and Interpolation/Extrapolation of Distributions

Increasingly complex data analysis tasks motivate the study of the depen...
research
06/22/2020

Wasserstein Autoregressive Models for Density Time Series

Data consisting of time-indexed distributions of cross-sectional or intr...
research
06/27/2023

Wasserstein Generative Regression

In this paper, we propose a new and unified approach for nonparametric r...
research
11/06/2021

Metric Distributional Discrepancy in Metric Space

Independence analysis is an indispensable step before regression analysi...

Please sign up or login with your details

Forgot password? Click here to reset