New efficient algorithms for multiple change-point detection with kernels

10/12/2017
by   Alain Celisse, et al.
0

Several statistical approaches based on reproducing kernels have been proposed to detect abrupt changes arising in the full distribution of the observations and not only in the mean or variance. Some of these approaches enjoy good statistical properties (oracle inequality, ...). Nonetheless, they have a high computational cost both in terms of time and memory. This makes their application difficult even for small and medium sample sizes (n< 10^4). This computational issue is addressed by first describing a new efficient and exact algorithm for kernel multiple change-point detection with an improved worst-case complexity that is quadratic in time and linear in space. It allows dealing with medium size signals (up to n ≈ 10^5). Second, a faster but approximation algorithm is described. It is based on a low-rank approximation to the Gram matrix. It is linear in time and space. This approximation algorithm can be applied to large-scale signals (n ≥ 10^6). These exact and approximation algorithms have been implemented in R and C for various kernels. The computational and statistical performances of these new algorithms have been assessed through empirical experiments. The runtime of the new algorithms is observed to be faster than that of other considered procedures. Finally, simulations confirmed the higher statistical accuracy of kernel-based approaches to detect changes that are not only in the mean. These simulations also illustrate the flexibility of kernel-based approaches to analyze complex biological profiles made of DNA copy number and allele B frequencies. An R package implementing the approach will be made available on github.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/25/2022

Variance change point detection with credible sets

This paper introduces a novel Bayesian approach to detect changes in the...
research
10/11/2022

Sequential change-point detection: Computation versus statistical performance

Change-point detection studies the problem of detecting the changes in t...
research
07/16/2019

Alternating Dynamic Programming for Multiple Epidemic Change-Point Estimation

In this paper, we study the problem of multiple change-point detection f...
research
08/02/2016

Hierarchically Compositional Kernels for Scalable Nonparametric Learning

We propose a novel class of kernels to alleviate the high computational ...
research
04/29/2018

A linear time algorithm for multiscale quantile simulation

Change-point problems have appeared in a great many applications for exa...
research
01/14/2023

Compress Then Test: Powerful Kernel Testing in Near-linear Time

Kernel two-sample testing provides a powerful framework for distinguishi...
research
05/01/2012

A Randomized Mirror Descent Algorithm for Large Scale Multiple Kernel Learning

We consider the problem of simultaneously learning to linearly combine a...

Please sign up or login with your details

Forgot password? Click here to reset