A log-linear time algorithm for constrained changepoint detection

03/09/2017
by   Toby Dylan Hocking, et al.
0

Changepoint detection is a central problem in time series and genomic data. For some applications, it is natural to impose constraints on the directions of changes. One example is ChIP-seq data, for which adding an up-down constraint improves peak detection accuracy, but makes the optimization problem more complicated. We show how a recently proposed functional pruning technique can be adapted to solve such constrained changepoint detection problems. This leads to a new algorithm which can solve problems with arbitrary affine constraints on adjacent segment means, and which has empirical time complexity that is log-linear in the amount of data. This algorithm achieves state-of-the-art accuracy in a benchmark of several genomic data sets, and is orders of magnitude faster than existing algorithms that have similar accuracy. Our implementation is available as the PeakSegPDPA function in the coseg R package, https://github.com/tdhock/coseg

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/29/2018

Generalized Functional Pruning Optimal Partitioning (GFPOP) for Constrained Changepoint Detection in Genomic Data

We describe a new algorithm and R package for peak detection in genomic ...
research
06/15/2023

Geometric-Based Pruning Rules For Change Point Detection in Multiple Independent Time Series

We consider the problem of detecting multiple changes in multiple indepe...
research
07/08/2020

Linear-Time Algorithms for Adaptive Submodular Maximization

In this paper, we develop fast algorithms for two stochastic submodular ...
research
06/03/2015

PeakSegJoint: fast supervised peak detection via joint segmentation of multiple count data samples

Joint peak detection is a central problem when comparing samples in geno...
research
10/11/2017

Maximum Margin Interval Trees

Learning a regression function using censored or interval-valued output ...
research
08/13/2018

Speeding Up Constrained k-Means Through 2-Means

For the constrained 2-means problem, we present a O(dn+d(1ϵ)^O(1ϵ) n) ti...
research
10/05/2022

Functional Labeled Optimal Partitioning

Peak detection is a problem in sequential data analysis that involves di...

Please sign up or login with your details

Forgot password? Click here to reset