Large-scale Multiple Testing: Fundamental Limits of False Discovery Rate Control and Compound Oracle

02/14/2023
by   Yutong Nie, et al.
0

The false discovery rate (FDR) and the false non-discovery rate (FNR), defined as the expected false discovery proportion (FDP) and the false non-discovery proportion (FNP), are the most popular benchmarks for multiple testing. Despite the theoretical and algorithmic advances in recent years, the optimal tradeoff between the FDR and the FNR has been largely unknown except for certain restricted class of decision rules, e.g., separable rules, or for other performance metrics, e.g., the marginal FDR and the marginal FNR (mFDR and mFNR). In this paper we determine the asymptotically optimal FDR-FNR tradeoff under the two-group random mixture model when the number of hypotheses tends to infinity. Distinct from the optimal mFDR-mFNR tradeoff, which is achieved by separable decision rules, the optimal FDR-FNR tradeoff requires compound rules and randomization even in the large-sample limit. A data-driven version of the oracle rule is proposed and shown to outperform existing methodologies on simulated data for models as simple as the normal mean model. Finally, to address the limitation of the FDR and FNR which only control the expectations but not the fluctuations of the FDP and FNP, we also determine the optimal tradeoff when the FDP and FNP are controlled with high probability and show it coincides with that of the mFDR and the mFNR.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/06/2021

Empirical Bayes Control of the False Discovery Exceedance

In sparse large-scale testing problems where the false discovery proport...
research
02/03/2019

Optimal FDR control in the two-group model

The highly influential two group model in testing a large number of stat...
research
02/28/2020

A New Procedure for Controlling False Discovery Rate in Large-Scale t-tests

This paper is concerned with false discovery rate (FDR) control in large...
research
07/15/2022

The edge of discovery: Controlling the local false discovery rate at the margin

Despite the popularity of the false discovery rate (FDR) as an error con...
research
12/13/2017

Local False Discovery Rate Based Methods for Multiple Testing of One-Way Classified Hypotheses

This paper continues the line of research initiated in Liu, Sarkar and Z...
research
04/18/2021

A central limit theorem for the Benjamini-Hochberg false discovery proportion under a factor model

The Benjamini-Hochberg (BH) procedure remains widely popular despite hav...

Please sign up or login with your details

Forgot password? Click here to reset