Intrinsic Sliced Wasserstein Distances for Comparing Collections of Probability Distributions on Manifolds and Graphs

10/28/2020
by   Raif M. Rustamov, et al.
0

Collections of probability distributions arise in a variety of statistical applications ranging from user activity pattern analysis to brain connectomics. In practice these distributions are represented by histograms over diverse domain types including finite intervals, circles, cylinders, spheres, other manifolds, and graphs. This paper introduces an approach for detecting differences between two collections of histograms over such general domains. To this end, we introduce the intrinsic slicing construction that yields a novel class of Wasserstein distances on manifolds and graphs. These distances are Hilbert embeddable, which allows us to reduce the histogram collection comparison problem to the comparison of means in a high-dimensional Euclidean space. We develop a hypothesis testing procedure based on conducting t-tests on each dimension of this embedding, then combining the resulting p-values using recently proposed p-value combination techniques. Our numerical experiments in a variety of data settings show that the resulting tests are powerful and the p-values are well-calibrated. Example applications to user activity patterns, spatial data, and brain connectomics are provided.

READ FULL TEXT

page 16

page 18

research
05/31/2019

Kernel Mean Embedding Based Hypothesis Tests for Comparing Spatial Point Patterns

This paper introduces an approach for detecting differences in the first...
research
02/11/2022

Inference for Projection-Based Wasserstein Distances on Finite Spaces

The Wasserstein distance is a distance between two probability distribut...
research
02/19/2020

Schoenberg-Rao distances: Entropy-based and geometry-aware statistical Hilbert distances

Distances between probability distributions that take into account the g...
research
05/27/2021

Stein's Method for Probability Distributions on 𝕊^1

In this paper, we propose a modification to the density approach to Stei...
research
06/20/2012

Statistical Translation, Heat Kernels and Expected Distances

High dimensional structured data such as text and images is often poorly...
research
01/26/2021

Probability distributions for analog-to-target distances

Some properties of chaotic dynamical systems can be probed through featu...
research
02/27/2017

Fast Threshold Tests for Detecting Discrimination

Threshold tests have recently been proposed as a robust method for detec...

Please sign up or login with your details

Forgot password? Click here to reset