U-statistics of growing order and sub-Gaussian mean estimators with sharp constants

02/24/2022
by   Stanislav Minsker, et al.
0

This paper addresses the following question: given a sample of i.i.d. random variables with finite variance, can one construct an estimator of the unknown mean that performs nearly as well as if the data were normally distributed? One of the most popular examples achieving this goal is the median of means estimator. However, it is inefficient in a sense that the constants in the resulting bounds are suboptimal. We show that a permutation-invariant modification of the median of means estimator admits deviation guarantees that are sharp up to 1+o(1) factor if the underlying distribution possesses 3+p moments for some p>0 and is absolutely continuous with respect to the Lebesgue measure. This result yields potential improvements for a variety of algorithms that rely on the median of means estimator as a building block. At the core of our argument is a new deviation inequality for the U-statistics of order that is allowed to grow with the sample size, a result that could be of independent interest. Finally, we demonstrate that a hybrid of the median of means and Catoni's estimator is capable of achieving sub-Gaussian deviation guarantees with nearly optimal constants assuming just the existence of the second moment.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/30/2023

Efficient median of means estimator

The goal of this note is to present a modification of the popular median...
research
02/01/2017

Sub-Gaussian estimators of the mean of a random vector

We study the problem of estimating the mean of a random vector X given a...
research
01/22/2021

On the robustness to adversarial corruption and to heavy-tailed data of the Stahel-Donoho median of means

We consider median of means (MOM) versions of the Stahel-Donoho outlying...
research
06/25/2019

Distribution-robust mean estimation via smoothed random perturbations

We consider the problem of mean estimation assuming only finite variance...
research
06/02/2020

Robust and efficient mean estimation: approach based on the properties of self-normalized sums

Let X be a random variable with unknown mean and finite variance. We pre...
research
04/24/2020

Robust subgaussian estimation with VC-dimension

Median-of-means (MOM) based procedures provide non-asymptotic and strong...
research
02/28/2018

Bahadur representations for the bootstrap median absolute deviation and the application to projection depth weighted mean

Median absolute deviation (hereafter MAD) is known as a robust alternati...

Please sign up or login with your details

Forgot password? Click here to reset