A central limit theorem for the Benjamini-Hochberg false discovery proportion under a factor model

04/18/2021
by   Dan M. Kluger, et al.
0

The Benjamini-Hochberg (BH) procedure remains widely popular despite having limited theoretical guarantees in the commonly encountered scenario of correlated test statistics. Of particular concern is the possibility that the method could exhibit bursty behavior, meaning that it might typically yield no false discoveries while occasionally yielding both a large number of false discoveries and a false discovery proportion (FDP) that far exceeds its own well controlled mean. In this paper, we investigate which test statistic correlation structures lead to bursty behavior and which ones lead to well controlled FDPs. To this end, we develop a central limit theorem for the FDP in a multiple testing setup where the test statistic correlations can be either short-range or long-range as well as either weak or strong. The theorem and our simulations from a data-driven factor model suggest that the BH procedure exhibits severe burstiness when the test statistics have many strong, long-range correlations, but does not otherwise.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2020

A factor-adjusted multiple testing of general alternatives

Factor-adjusted multiple testing is used for handling strong correlated ...
research
07/03/2022

Asymptotic Uncertainty of False Discovery Proportion

Multiple testing has been a popular topic in statistical research. Altho...
research
07/17/2018

Knockoffs for the mass: new feature importance statistics with false discovery guarantees

An important problem in machine learning and statistics is to identify f...
research
06/18/2019

Multiple Testing Embedded in an Aggregation Tree to Identify where Two Distributions Differ

A key goal of flow cytometry data analysis is to identify the subpopulat...
research
02/14/2023

Large-scale Multiple Testing: Fundamental Limits of False Discovery Rate Control and Compound Oracle

The false discovery rate (FDR) and the false non-discovery rate (FNR), d...
research
08/21/2019

Paired Test of Matrix Graphs and Brain Connectivity Analysis

Inferring brain connectivity network and quantifying the significance of...
research
02/27/2017

Statistical Anomaly Detection via Composite Hypothesis Testing for Markov Models

Under Markovian assumptions, we leverage a Central Limit Theorem (CLT) f...

Please sign up or login with your details

Forgot password? Click here to reset