A Comparison of Single and Multiple Changepoint Techniques for Time Series Data

01/06/2021
by   Xueheng Shi, et al.
0

This paper describes and compares several prominent single and multiple changepoint techniques for time series data. Due to their importance in inferential matters, changepoint research on correlated data has accelerated recently. Unfortunately, small perturbations in model assumptions can drastically alter changepoint conclusions; for example, heavy positive correlation in a time series can be misattributed to a mean shift should correlation be ignored. This paper considers both single and multiple changepoint techniques. The paper begins by examining cumulative sum (CUSUM) and likelihood ratio tests and their variants for the single changepoint problem; here, various statistics, boundary cropping scenarios, and scaling methods (e.g., scaling to an extreme value or Brownian Bridge limit) are compared. A recently developed test based on summing squared CUSUM statistics over all times is shown to have realistic Type I errors and superior detection power. The paper then turns to the multiple changepoint setting. Here, penalized likelihoods drive the discourse, with AIC, BIC, mBIC, and MDL penalties being considered. Binary and wild binary segmentation techniques are also compared. We introduce a new distance metric specifically designed to compare two multiple changepoint segmentations. Algorithmic and computational concerns are discussed and simulations are provided to support all conclusions. In the end, the multiple changepoint setting admits no clear methodological winner, performance depending on the particular scenario. Nonetheless, some practical guidance will emerge.

READ FULL TEXT

page 9

page 10

research
12/06/2022

Good Practices and Common Pitfalls in Climate Time Series Changepoint Techniques: A Review

Climate changepoint (homogenization) methods abound today, with a myriad...
research
08/20/2020

Exact Tests for Offline Changepoint Detection in Multichannel Binary and Count Data with Application to Networks

We consider offline detection of a single changepoint in binary and coun...
research
02/26/2018

Partial Distance Correlation Screening for High Dimensional Time Series

High dimensional time series datasets are becoming increasingly common i...
research
05/30/2023

Learning Perturbations to Explain Time Series Predictions

Explaining predictions based on multivariate time series data carries th...
research
08/18/2020

A Formally Robust Time Series Distance Metric

Distance-based classification is among the most competitive classificati...
research
01/06/2023

A Robust Data-driven Process Modeling Applied to Time-series Stochastic Power Flow

In this paper, we propose a robust data-driven process model whose hyper...

Please sign up or login with your details

Forgot password? Click here to reset