Covariate Adaptive False Discovery Rate Control with Applications to Omics-Wide Multiple Testing

09/11/2019
by   Xianyang Zhang, et al.
0

Conventional multiple testing procedures often assume hypotheses for different features are exchangeable. However, in many scientific applications, additional covariate information regarding the patterns of signals and nulls are available. In this paper, we introduce an FDR control procedure in large-scale inference problem that can incorporate covariate information. We develop a fast algorithm to implement the proposed procedure and prove its asymptotic validity even when the underlying model is misspecified and the p-values are weakly dependent (e.g., strong mixing). Extensive simulations are conducted to study the finite sample performance of the proposed method and we demonstrate that the new approach improves over the state-of-the-art approaches by being flexible, robust, powerful and computationally efficient. We finally apply the method to several omics datasets arising from genomics studies with the aim to identify omics features associated with some clinical and biological phenotypes. We show that the method is overall the most powerful among competing methods, especially when the signal is sparse. The proposed Covariate Adaptive Multiple Testing procedure is implemented in the R package CAMT.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/02/2020

Covariate Adaptive Family-wise Error Rate Control for Genome-Wide Association Studies

The family-wise error rate (FWER) has been widely used in genome-wide as...
research
06/30/2021

AdaPT-GMM: Powerful and robust covariate-assisted multiple testing

We propose a new empirical Bayes method for covariate-assisted multiple ...
research
02/21/2020

Adaptive Covariate Acquisition for Minimizing Total Cost of Classification

In some applications, acquiring covariates comes at a cost which is not ...
research
03/16/2018

False discovery rate control for multiple testing based on p-values with càdlàg distribution functions

For multiple testing based on p-values with càdlàg distribution function...
research
08/28/2021

ZAP: Z-value Adaptive Procedures for False Discovery Rate Control with Side Information

Adaptive multiple testing with covariates is an important research direc...
research
06/26/2018

Flexible Multiple Testing with the FACT Algorithm

Modern high-throughput science often leads to multiple testing problems:...
research
01/25/2022

NAPA: Neighborhood-Assisted and Posterior-Adjusted Two-sample Inference

Two-sample multiple testing problems of sparse spatial data are frequent...

Please sign up or login with your details

Forgot password? Click here to reset