The Geometric Median and Applications to Robust Mean Estimation

07/06/2023
by   Stanislav Minsker, et al.
0

This paper is devoted to the statistical and numerical properties of the geometric median, and its applications to the problem of robust mean estimation via the median of means principle. Our main theoretical results include (a) an upper bound for the distance between the mean and the median for general absolutely continuous distributions in R^d, and examples of specific classes of distributions for which these bounds do not depend on the ambient dimension d; (b) exponential deviation inequalities for the distance between the sample and the population versions of the geometric median, which again depend only on the trace-type quantities and not on the ambient dimension. As a corollary, we deduce improved bounds for the (geometric) median of means estimator that hold for large classes of heavy-tailed distributions. Finally, we address the error of numerical approximation, which is an important practical aspect of any statistical estimation procedure. We demonstrate that the objective function minimized by the geometric median satisfies a "local quadratic growth" condition that allows one to translate suboptimality bounds for the objective function to the corresponding bounds for the numerical approximation to the median itself, and propose a simple stopping rule applicable to any optimization method which yields explicit error guarantees. We conclude with the numerical experiments including the application to estimation of mean values of log-returns for S P 500 data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/24/2020

Robust subgaussian estimation with VC-dimension

Median-of-means (MOM) based procedures provide non-asymptotic and strong...
research
04/09/2017

Distributed Statistical Estimation and Rates of Convergence in Normal Approximation

This paper presents new algorithms for distributed statistical estimatio...
research
06/09/2020

How Robust is the Median-of-Means? Concentration Bounds in Presence of Outliers

In contrast to the empirical mean, the Median-of-Means (MoM) is an estim...
research
06/11/2020

Robust Optimization and Inference on Manifolds

We propose a robust and scalable procedure for general optimization and ...
research
03/27/2013

Appropriate and Inappropriate Estimation Techniques

Mode also called MAP estimation, mean estimation and median estimation a...
research
01/25/2023

Robust non-parametric regression via median-of-means

In this paper, we apply the median-of-means principle to derive robust v...
research
04/03/2023

Online stochastic Newton methods for estimating the geometric median and applications

In the context of large samples, a small number of individuals might spo...

Please sign up or login with your details

Forgot password? Click here to reset