Distribution Regression with Sliced Wasserstein Kernels

02/08/2022
by   Dimitri Meunier, et al.
0

The problem of learning functions over spaces of probabilities - or distribution regression - is gaining significant interest in the machine learning community. A key challenge behind this problem is to identify a suitable representation capturing all relevant properties of the underlying functional mapping. A principled approach to distribution regression is provided by kernel mean embeddings, which lifts kernel-induced similarity on the input domain at the probability level. This strategy effectively tackles the two-stage sampling nature of the problem, enabling one to derive estimators with strong statistical guarantees, such as universal consistency and excess risk bounds. However, kernel mean embeddings implicitly hinge on the maximum mean discrepancy (MMD), a metric on probabilities, which may fail to capture key geometrical relations between distributions. In contrast, optimal transport (OT) metrics, are potentially more appealing, as documented by the recent literature on the topic. In this work, we propose the first OT-based estimator for distribution regression. We build on the Sliced Wasserstein distance to obtain an OT-based representation. We study the theoretical properties of a kernel ridge regression estimator based on such representation, for which we prove universal consistency and excess risk bounds. Preliminary experiments complement our theoretical findings by showing the effectiveness of the proposed approach and compare it with MMD-based estimators.

READ FULL TEXT
research
03/01/2018

Wasserstein Distance Measure Machines

This paper presents a distance-based discriminative framework for learni...
research
10/15/2021

Kernel Minimum Divergence Portfolios

Portfolio optimization is a key challenge in finance with the aim of cre...
research
08/28/2023

Improved learning theory for kernel distribution regression with two-stage sampling

The distribution regression problem encompasses many important statistic...
research
02/08/2020

Statistical Optimal Transport posed as Learning Kernel Embedding

This work takes the novel approach of posing the statistical Optimal Tra...
research
11/08/2014

Learning Theory for Distribution Regression

We focus on the distribution regression problem: regressing to vector-va...
research
06/17/2020

Kernel Alignment Risk Estimator: Risk Prediction from Training Data

We study the risk (i.e. generalization error) of Kernel Ridge Regression...
research
02/07/2014

Two-stage Sampled Learning Theory on Distributions

We focus on the distribution regression problem: regressing to a real-va...

Please sign up or login with your details

Forgot password? Click here to reset