Online Heavy-tailed Change-point detection

06/15/2023
by   Abishek Sankararaman, et al.
0

We study algorithms for online change-point detection (OCPD), where samples that are potentially heavy-tailed, are presented one at a time and a change in the underlying mean must be detected as early as possible. We present an algorithm based on clipped Stochastic Gradient Descent (SGD), that works even if we only assume that the second moment of the data generating process is bounded. We derive guarantees on worst-case, finite-sample false-positive rate (FPR) over the family of all distributions with bounded second moment. Thus, our method is the first OCPD algorithm that guarantees finite-sample FPR, even if the data is high dimensional and the underlying distributions are heavy-tailed. The technical contribution of our paper is to show that clipped-SGD can estimate the mean of a random vector and simultaneously provide confidence bounds at all confidence values. We combine this robust estimate with a union bound argument and construct a sequential change-point algorithm with finite-sample FPR guarantees. We show empirically that our algorithm works well in a variety of situations, whether the underlying data are heavy-tailed, light-tailed, high dimensional or discrete. No other algorithm achieves bounded FPR theoretically or empirically, over all settings we study simultaneously.

READ FULL TEXT
research
05/15/2019

Robust change point tests by bounded transformations

Classical moment based change point tests like the cusum test are very p...
research
04/06/2019

A robust bootstrap change point test for high-dimensional location parameter

We consider the problem of change point detection for high-dimensional d...
research
11/29/2022

Outlier-Robust Sparse Mean Estimation for Heavy-Tailed Distributions

We study the fundamental task of outlier-robust mean estimation for heav...
research
03/18/2019

Robust Inference via Multiplier Bootstrap

This paper investigates the theoretical underpinnings of two fundamental...
research
11/02/2021

Inference in high-dimensional online changepoint detection

We introduce and study two new inferential challenges associated with th...
research
01/26/2021

Ethereum ECCPoW

The error-correction code based proof-of-work (ECCPoW) algorithm is base...
research
10/20/2022

A lower confidence sequence for the changing mean of non-negative right heavy-tailed observations with bounded mean

A confidence sequence (CS) is an anytime-valid sequential inference prim...

Please sign up or login with your details

Forgot password? Click here to reset