Schoenberg-Rao distances: Entropy-based and geometry-aware statistical Hilbert distances

02/19/2020
by   Gaëtan Hadjeres, et al.
22

Distances between probability distributions that take into account the geometry of their sample space,like the Wasserstein or the Maximum Mean Discrepancy (MMD) distances have received a lot of attention in machine learning as they can, for instance, be used to compare probability distributions with disjoint supports. In this paper, we study a class of statistical Hilbert distances that we term the Schoenberg-Rao distances, a generalization of the MMD that allows one to consider a broader class of kernels, namely the conditionally negative semi-definite kernels. In particular, we introduce a principled way to construct such kernels and derive novel closed-form distances between mixtures of Gaussian distributions, among others. These distances, derived from the concave Rao's quadratic entropy, enjoy nice theoretical properties and possess interpretable hyperparameters which can be tuned for specific applications. Our method constitutes a practical alternative to Wasserstein distances and we illustrate its efficiency on a broad range of machine learning tasks such as density estimation, generative modeling and mixture simplification.

READ FULL TEXT
research
11/10/2015

Sliced Wasserstein Kernels for Probability Distributions

Optimal transport distances, otherwise known as Wasserstein distances, h...
research
02/28/2020

Generalized Sliced Distances for Probability Distributions

Probability metrics have become an indispensable part of modern statisti...
research
01/31/2017

Gaussian Process Regression Model for Distribution Inputs

Monge-Kantorovich distances, otherwise known as Wasserstein distances, h...
research
10/28/2020

Intrinsic Sliced Wasserstein Distances for Comparing Collections of Probability Distributions on Manifolds and Graphs

Collections of probability distributions arise in a variety of statistic...
research
10/14/2020

Measuring the originality of intellectual property assets based on machine learning outputs

Originality criteria are frequently used to assess the validity of intel...
research
06/19/2019

GEAR: Geometry-Aware Rényi Information

Shannon's seminal theory of information has been of paramount importance...
research
12/22/2021

Robust learning of data anomalies with analytically-solvable entropic outlier sparsification

Entropic Outlier Sparsification (EOS) is proposed as a robust computatio...

Please sign up or login with your details

Forgot password? Click here to reset