AdaPT-GMM: Powerful and robust covariate-assisted multiple testing

06/30/2021
by   Patrick Chao, et al.
0

We propose a new empirical Bayes method for covariate-assisted multiple testing with false discovery rate (FDR) control, where we model the local false discovery rate for each hypothesis as a function of both its covariates and p-value. Our method refines the adaptive p-value thresholding (AdaPT) procedure by generalizing its masking scheme to reduce the bias and variance of its false discovery proportion estimator, improving the power when the rejection set is small or some null p-values concentrate near 1. We also introduce a Gaussian mixture model for the conditional distribution of the test statistics given covariates, modeling the mixing proportions with a generic user-specified classifier, which we implement using a two-layer neural network. Like AdaPT, our method provably controls the FDR in finite samples even if the classifier or the Gaussian mixture model is misspecified. We show in extensive simulations and real data examples that our new method, which we call AdaPT-GMM, consistently delivers high power relative to competing state-of-the-art methods. In particular, it performs well in scenarios where AdaPT is underpowered, and is especially well-suited for testing composite null hypothesis, such as whether the effect size exceeds a practical significance threshold.

READ FULL TEXT

page 20

page 40

research
07/15/2022

The edge of discovery: Controlling the local false discovery rate at the margin

Despite the popularity of the false discovery rate (FDR) as an error con...
research
02/21/2020

Adaptive Covariate Acquisition for Minimizing Total Cost of Classification

In some applications, acquiring covariates comes at a cost which is not ...
research
09/11/2019

Covariate Adaptive False Discovery Rate Control with Applications to Omics-Wide Multiple Testing

Conventional multiple testing procedures often assume hypotheses for dif...
research
11/15/2020

MixTwice: large-scale hypothesis testing for peptide arrays by variance mixing

Peptide microarrays have emerged as a powerful technology in immunoprote...
research
08/28/2021

ZAP: Z-value Adaptive Procedures for False Discovery Rate Control with Side Information

Adaptive multiple testing with covariates is an important research direc...
research
03/07/2019

Relaxing the Assumptions of Knockoffs by Conditioning

The recent paper Candès et al. (2018) introduced model-X knockoffs, a me...
research
05/11/2018

False Discovery Rate Control Under Reduced Precision Computation

The mitigation of false positives is an important issue when conducting ...

Please sign up or login with your details

Forgot password? Click here to reset