Catoni-style confidence sequences for heavy-tailed mean estimation

02/02/2022
by   Hongjian Wang, et al.
0

A confidence sequence (CS) is a sequence of confidence intervals that is valid at arbitrary data-dependent stopping times. These are being employed in an ever-widening scope of applications involving sequential experimentation, such as A/B testing, multi-armed bandits, off-policy evaluation, election auditing, etc. In this paper, we present three approaches to constructing a confidence sequence for the population mean, under the extremely relaxed assumption that only an upper bound on the variance is known. While previous works all rely on stringent tail-lightness assumptions like boundedness or sub-Gaussianity (under which all moments of a distribution exist), the confidence sequences in our work are able to handle data from a wide range of heavy-tailed distributions (where no moment beyond the second is required to exist). Moreover, we show that even under such a simple assumption, the best among our three methods, namely the Catoni-style confidence sequence, performs remarkably well in terms of tightness, essentially matching the best methods for sub-Gaussian data. Our findings have important practical implications when experimenting with unbounded observations, since the finite-variance assumption is often more realistic and easier to verify than sub-Gaussianity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/05/2022

Catoni-style Confidence Sequences under Infinite Variance

In this paper, we provide an extension of confidence sequences for setti...
research
10/20/2022

A lower confidence sequence for the changing mean of non-negative right heavy-tailed observations with bounded mean

A confidence sequence (CS) is an anytime-valid sequential inference prim...
research
01/23/2023

Huber-Robust Confidence Sequences

Confidence sequences are confidence intervals that can be sequentially t...
research
07/06/2021

Distributed Adaptive Huber Regression

Distributed data naturally arise in scenarios involving multiple sources...
research
09/19/2018

Mean Estimation with Sub-Gaussian Rates in Polynomial Time

We study polynomial time algorithms for estimating the mean of a heavy-t...
research
10/18/2018

Uniform, nonparametric, non-asymptotic confidence sequences

A confidence sequence is a sequence of confidence intervals that is unif...
research
02/14/2018

ICA based on Split Generalized Gaussian

Independent Component Analysis (ICA) - one of the basic tools in data an...

Please sign up or login with your details

Forgot password? Click here to reset