Post-Selection Inference for Changepoint Detection Algorithms with Application to Copy Number Variation Data

12/10/2018
by   Sangwon Hyun, et al.
0

Changepoint detection methods are used in many areas of science and engineering, e.g., in the analysis of copy number variation data, to detect abnormalities in copy numbers along the genome. Despite the broad array of available tools, methodology for quantifying our uncertainty in the strength (or presence) of given changepoints, post-detection, are lacking. Post-selection inference offers a framework to fill this gap, but the most straightforward application of these methods results in low-powered tests and leaves open several important questions about practical usability. In this work, we carefully tailor post-selection inference methods towards changepoint detection, focusing as our main scientific application on copy number variation data. As for changepoint algorithms, we study binary segmentation, and two of its most popular variants, wild and circular, and the fused lasso. We implement some of the latest developments in post-selection inference theory: we use auxiliary randomization to improve power, which requires implementations of MCMC algorithms (importance sampling and hit-and-run sampling) to carry out our tests. We also provide recommendations for improving practical useability, detailed simulations, and an example analysis on array comparative genomic hybridization (CGH) data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2019

Testing for a Change in Mean After Changepoint Detection

While many methods are available to detect structural changes in a time ...
research
01/13/2023

Improving Power by Conditioning on Less in Post-selection Inference for Changepoints

Post-selection inference has recently been proposed as a way of quantify...
research
08/17/2012

An Evaluation of Popular Copy-Move Forgery Detection Approaches

A copy-move forgery is created by copying and pasting content within the...
research
11/18/2020

Post-Selection Inference via Algorithmic Stability

Modern approaches to data analysis make extensive use of data-driven mod...
research
12/29/2021

Exact Post-selection Inference For Tracking S P500

The problem that is solved in this paper is known as index tracking. The...
research
05/21/2023

A parametric distribution for exact post-selection inference with data carving

Post-selection inference (PoSI) is a statistical technique for obtaining...
research
12/05/2019

Almost-monochromatic sets and the chromatic number of the plane

In a colouring of R^d a pair (S,s_0) with S⊆R^d and with s_0∈ S is almos...

Please sign up or login with your details

Forgot password? Click here to reset