Geometry of Sample Spaces

10/15/2020
by   Philipp Harms, et al.
0

In statistics, independent, identically distributed random samples do not carry a natural ordering, and their statistics are typically invariant with respect to permutations of their order. Thus, an n-sample in a space M can be considered as an element of the quotient space of M^n modulo the permutation group. The present paper takes this definition of sample space and the related concept of orbit types as a starting point for developing a geometric perspective on statistics. We aim at deriving a general mathematical setting for studying the behavior of empirical and population means in spaces ranging from smooth Riemannian manifolds to general stratified spaces. We fully describe the orbifold and path-metric structure of the sample space when M is a manifold or path-metric space, respectively. These results are non-trivial even when M is Euclidean. We show that the infinite sample space exists in a Gromov-Hausdorff type sense and coincides with the Wasserstein space of probability distributions on M. We exhibit Fréchet means and k-means as metric projections onto 1-skeleta or k-skeleta in Wasserstein space, and we define a new and more general notion of polymeans. This geometric characterization via metric projections applies equally to sample and population means, and we use it to establish asymptotic properties of polymeans such as consistency and asymptotic normality.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/16/2018

Natural gradient via optimal transport I

We study a natural Wasserstein gradient flow on manifolds of probability...
research
10/19/2020

On the Consistency of Metric and Non-Metric K-medoids

We establish the consistency of K-medoids in the context of metric space...
research
07/07/2021

Geometric averages of partitioned datasets

We introduce a method for jointly registering ensembles of partitioned d...
research
11/23/2020

Level sets of depth measures and central dispersion in abstract spaces

The lens depth of a point have been recently extended to general metric ...
research
12/01/2021

Diffusion Mean Estimation on the Diagonal of Product Manifolds

Computing sample means on Riemannian manifolds is typically computationa...
research
09/07/2020

Ensemble Riemannian Data Assimilation over the Wasserstein Space

In this paper, we present a new ensemble data assimilation paradigm over...
research
07/29/2022

Tangential Wasserstein Projections

We develop a notion of projections between sets of probability measures ...

Please sign up or login with your details

Forgot password? Click here to reset