Optimal Sub-Gaussian Mean Estimation in ℝ

11/17/2020
by   Jasper C. H. Lee, et al.
0

We revisit the problem of estimating the mean of a real-valued distribution, presenting a novel estimator with sub-Gaussian convergence: intuitively, "our estimator, on any distribution, is as accurate as the sample mean is for the Gaussian distribution of matching variance." Crucially, in contrast to prior works, our estimator does not require prior knowledge of the variance, and works across the entire gamut of distributions with bounded variance, including those without any higher moments. Parameterized by the sample size n, the failure probability δ, and the variance σ^2, our estimator is accurate to within σ·(1+o(1))√(2log1/δ/n), tight up to the 1+o(1) factor. Our estimator construction and analysis gives a framework generalizable to other problems, tightly analyzing a sum of dependent random variables by viewing the sum implicitly as a 2-parameter ψ-estimator, and constructing bounds using mathematical programming and duality techniques.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/27/2021

Quantum Sub-Gaussian Mean Estimator

We present a new quantum algorithm for estimating the mean of a real-val...
research
02/06/2019

Fast Mean Estimation with Sub-Gaussian Rates

We propose an estimator for the mean of a random vector in R^d that can ...
research
06/02/2020

Robust and efficient mean estimation: approach based on the properties of self-normalized sums

Let X be a random variable with unknown mean and finite variance. We pre...
research
06/12/2020

Stochastic Analysis of Collision Estimator

We prove a strong concentration result about the collision estimator, wh...
research
01/26/2019

On strict sub-Gaussianity, optimal proxy variance and symmetry for bounded random variables

We investigate the sub-Gaussian property for almost surely bounded rando...
research
02/16/2021

Sample variance of rounded variables

If the rounding errors are assumed to be distributed independently from ...
research
02/03/2021

CountSketches, Feature Hashing and the Median of Three

In this paper, we revisit the classic CountSketch method, which is a spa...

Please sign up or login with your details

Forgot password? Click here to reset