Asymptotic distribution-free change-point detection for data with repeated observations

06/18/2020
by   Hoseung Song, et al.
0

In the regime of change-point detection, a nonparametric framework based on scan statistics utilizing graphs representing similarities among observations is gaining attention due to its flexibility and good performances for high-dimensional and non-Euclidean data sequences, which are ubiquitous in this big data era. However, this graph-based framework encounters problems when there are repeated observations in the sequence, which often happens for discrete data, such as network data. In this work, we extend the graph-based framework to solve this problem by averaging or taking union of all possible "optimal" graphs resulted from repeated observations. We consider both the single change-point alternative and the changed-interval alternative, and derive analytic formulas to control the type I error for the new methods, making them fast applicable to large data sets. The extended methods are illustrated on an application in detecting changes in a sequence of dynamic networks over time.

READ FULL TEXT
research
03/03/2021

Weighted-Graph-Based Change Point Detection

We consider the detection and localization of change points in the distr...
research
11/12/2017

Graph-Based Two-Sample Tests for Discrete Data

In the regime of two-sample comparison, tests based on a graph construct...
research
06/07/2022

RING-CPD: Asymptotic Distribution-free Change-point Detection for Multivariate and Non-Euclidean Data

Change-point detection (CPD) concerns detecting distributional changes i...
research
06/24/2020

A Fast and Efficient Change-point Detection Framework for Modern Data

Change-point analysis is thriving in this big data era to address proble...
research
03/05/2019

Change-point detection for multivariate and non-Euclidean data with local dependency

In a sequence of multivariate observations or non-Euclidean data objects...
research
10/14/2018

Sequential Change-point Detection for High-dimensional and non-Euclidean Data

In many modern applications, high-dimensional/non-Euclidean data sequenc...
research
06/03/2022

New kernel-based change-point detection

Change-point analysis plays a significant role in various fields to reve...

Please sign up or login with your details

Forgot password? Click here to reset