Data segmentation algorithms: Univariate mean change and beyond

12/23/2020
by   Haeran Cho, et al.
0

Data segmentation a.k.a. multiple change point analysis has received considerable attention due to its importance in time series analysis and signal processing, with applications in a variety of fields including natural and social sciences, medicine, engineering and finance. In the first part of this survey, we review the existing literature on the canonical data segmentation problem which aims at detecting and localising multiple change points in the mean of univariate time series. We provide an overview of popular methodologies on their computational complexity and theoretical properties. In particular, our theoretical discussion focuses on the separation rate relating to which change points are detectable by a given procedure, and the localisation rate quantifying the precision of corresponding change point estimators, and we distinguish between whether a homogeneous or multiscale viewpoint has been adopted in their derivation. We further highlight that the latter viewpoint provides the most general setting for investigating the optimality of data segmentation algorithms. Arguably, the canonical segmentation problem has been the most popular framework to propose new data segmentation algorithms and study their efficiency in the last decades. In the second part of this survey, we motivate the importance of attaining an in-depth understanding of strengths and weaknesses of methodologies for the change point problem in a simpler, univariate setting, as a stepping stone for the development of methodologies for more complex problems. We illustrate this with a range of examples showcasing the connections between complex distributional changes and those in the mean. We also discuss extensions towards high-dimensional change point problems where we demonstrate that the challenges arising from high dimensionality are orthogonal to those in dealing with multiple change points.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/28/2019

Localised pruning for data segmentation based on multiscale change point procedures

The segmentation of a time series into piecewise stationary segments is ...
research
10/22/2018

Univariate Mean Change Point Detection: Penalization, CUSUM and Optimality

The problem of univariate mean change point detection and localization b...
research
05/24/2019

Optimal nonparametric change point detection and localization

We study change point detection and localization for univariate data in ...
research
01/12/2021

Moving sum data segmentation for stochastics processes based on invariance

The segmentation of data into stationary stretches also known as multipl...
research
02/16/2020

Seeded Binary Segmentation: A general methodology for fast and optimal change point detection

In recent years, there has been an increasing demand on efficient algori...
research
05/23/2022

Robust multiscale estimation of time-average variance for time series segmentation

There exist several methods developed for the canonical change point pro...
research
06/15/2023

Geometric-Based Pruning Rules For Change Point Detection in Multiple Independent Time Series

We consider the problem of detecting multiple changes in multiple indepe...

Please sign up or login with your details

Forgot password? Click here to reset