MONK -- Outlier-Robust Mean Embedding Estimation by Median-of-Means

02/13/2018
by   Matthieu Lerasle, et al.
0

Mean embeddings provide an extremely flexible and powerful tool in machine learning and statistics to represent probability distributions and define a semi-metric (MMD, maximum mean discrepancy; also called N-distance or energy distance), with numerous successful applications. The representation is constructed as the expectation of the feature map defined by a kernel. As a mean, its classical empirical estimator, however, can be arbitrary severely affected even by a single outlier in case of unbounded features. To the best of our knowledge, unfortunately even the consistency of the existing few techniques trying to alleviate this serious sensitivity bottleneck is unknown. In this paper, we show how the recently emerged principle of median-of-means can be used to design minimax-optimal estimators for kernel mean embedding and MMD, with finite-sample strong outlier-robustness guarantees.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/31/2019

Quantum Mean Embedding of Probability Distributions

The kernel mean embedding of probability distributions is commonly used ...
research
06/30/2020

Robust Kernel Density Estimation with Median-of-Means principle

In this paper, we introduce a robust nonparametric density estimator com...
research
05/21/2014

Kernel Mean Shrinkage Estimators

A mean function in a reproducing kernel Hilbert space (RKHS), or a kerne...
research
05/05/2021

Non-asymptotic analysis and inference for an outlyingness induced winsorized mean

Robust estimation of a mean vector, a topic regarded as obsolete in the ...
research
03/01/2015

Sparse Approximation of a Kernel Mean

Kernel means are frequently used to represent probability distributions ...
research
12/12/2019

Finite sample properties of parametric MMD estimation: robustness to misspecification and dependence

Many works in statistics aim at designing a universal estimation procedu...
research
03/12/2020

Asymptotic normality of a generalized maximum mean discrepancy estimator

In this paper, we propose an estimator of the generalized maximum mean d...

Please sign up or login with your details

Forgot password? Click here to reset