Optimal FDR control in the two-group model

02/03/2019
by   Ruth Heller, et al.
0

The highly influential two group model in testing a large number of statistical hypothesis assumes that the test statistics come from a mixture of a high probability null distribution and a low probability alternative. Optimal control of the marginal false discovery rate (mFDR), in the sense that it provides maximal power (expected true discoveries) subject to mFDR control, is achieved by thresholding the local false discovery rate (locFDR) with a fixed threshold. In this paper we address the challenge of controlling the popular false discovery rate (FDR) rather than mFDR in the two group model. Since FDR is less conservative, this results in more rejections. We derive the optimal multiple testing (OMT) policy for this task, which turns out to be thresholding the locFDR with a threshold that is a function of the entire set of statistics. We show how to evaluate this threshold in time that is linear in the number of hypotheses, leading to an efficient algorithm for finding this policy. Thus, we can easily derive and apply the optimal procedure for problems with thousands of hypotheses. We show that for K=5000 hypotheses there can be significant power gain in OMT with FDR versus mFDR control.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/26/2018

Optimal Procedures for Multiple Testing Problems

Multiple testing problems are a staple of modern statistical analysis. T...
research
11/06/2021

Empirical Bayes Control of the False Discovery Exceedance

In sparse large-scale testing problems where the false discovery proport...
research
12/17/2022

Inference with approximate local false discovery rates

Efron's two-group model is widely used in large scale multiple testing. ...
research
06/17/2015

Communication-Efficient False Discovery Rate Control via Knockoff Aggregation

The false discovery rate (FDR)---the expected fraction of spurious disco...
research
09/06/2018

Controlling FDR while highlighting distinct discoveries

Often modern scientific investigations start by testing a very large num...
research
02/14/2023

Large-scale Multiple Testing: Fundamental Limits of False Discovery Rate Control and Compound Oracle

The false discovery rate (FDR) and the false non-discovery rate (FNR), d...
research
11/03/2017

NeuralFDR: Learning Discovery Thresholds from Hypothesis Features

As datasets grow richer, an important challenge is to leverage the full ...

Please sign up or login with your details

Forgot password? Click here to reset