Inference with approximate local false discovery rates

12/17/2022
by   Rajesh Karmakar, et al.
0

Efron's two-group model is widely used in large scale multiple testing. This model assumes that test statistics are mutually independent, however in realistic settings they are typically dependent, and taking the dependence into account can boost power. The general two-group model takes the dependence between the test statistics into account. Optimal policies in the general two-group model require calculation, for each hypothesis, of the probability that it is a true null given all test statistics, denoted local false discovery rate (locFDR). Unfortunately, calculating locFDRs under realistic dependence structures can be computationally prohibitive. We propose calculating approximate locFDRs based on a properly defined N-neighborhood for each hypothesis. We prove that by thresholding the approximate locFDRs with a fixed threshold, the marginal false discovery rate is controlled for any dependence structure. Furthermore, we prove that this is the optimal procedure in a restricted class of decision rules, where decision for each hypothesis is only guided by its N-neighborhood. We show through extensive simulations that our proposed method achieves substantial power gains compared to alternative practical approaches, while maintaining conceptual simplicity and computational feasibility. We demonstrate the utility of our method on a genome wide association study of height.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/03/2019

Optimal FDR control in the two-group model

The highly influential two group model in testing a large number of stat...
research
07/20/2020

Conditional calibration for false discovery rate control under dependence

We introduce a new class of methods for finite-sample false discovery ra...
research
03/29/2021

Optimal False Discovery Rate Control for Large Scale Multiple Testing with Auxiliary Information

Large-scale multiple testing is a fundamental problem in high dimensiona...
research
10/17/2019

Information Loss and Power Distortion from Standardizing in Multiple Hypothesis Testing

Standardization has been a widely adopted practice in multiple testing, ...
research
11/20/2019

Smoothed Nested Testing on Directed Acyclic Graphs

We consider the problem of multiple hypothesis testing when there is a l...
research
09/16/2016

Discovering Relationships and their Structures Across Disparate Data Modalities

Determining whether certain properties are related to other properties i...
research
11/03/2017

NeuralFDR: Learning Discovery Thresholds from Hypothesis Features

As datasets grow richer, an important challenge is to leverage the full ...

Please sign up or login with your details

Forgot password? Click here to reset