Derandomized knockoffs: leveraging e-values for false discovery rate control

05/30/2022
by   Zhimei Ren, et al.
0

Model-X knockoffs is a flexible wrapper method for high-dimensional regression algorithms, which provides guaranteed control of the false discovery rate (FDR). Due to the randomness inherent to the method, different runs of model-X knockoffs on the same dataset often result in different sets of selected variables, which is undesirable in practice. In this paper, we introduce a methodology for derandomizing model-X knockoffs with provable FDR control. The key insight of our proposed method lies in the discovery that the knockoffs procedure is in essence an e-BH procedure. We make use of this connection, and derandomize model-X knockoffs by aggregating the e-values resulting from multiple knockoff realizations. We prove that the derandomized procedure controls the FDR at the desired level, without any additional conditions (in contrast, previously proposed methods for derandomization are not able to guarantee FDR control). The proposed method is evaluated with numerical experiments, where we find that the derandomized procedure achieves comparable power and dramatically decreased selection variability when compared with model-X knockoffs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/08/2019

Aggregated False Discovery Rate Control

We propose an aggregation scheme for methods that control the false disc...
research
02/14/2023

Derandomized Novelty Detection with FDR Control via Conformal E-values

Conformal prediction and other randomized model-free inference technique...
research
09/06/2020

False discovery rate control with e-values

E-values have gained recent attention as potential alternatives to p-val...
research
11/13/2008

P-values for high-dimensional regression

Assigning significance in high-dimensional regression is challenging. Mo...
research
02/21/2020

Aggregation of Multiple Knockoffs

We develop an extension of the Knockoff Inference procedure, introduced ...
research
12/12/2018

Multiple Model-Free Knockoffs

Model-free knockoffs is a recently proposed technique for identifying co...
research
10/05/2022

Sample-and-Forward: Communication-Efficient Control of the False Discovery Rate in Networks

This work concerns controlling the false discovery rate (FDR) in network...

Please sign up or login with your details

Forgot password? Click here to reset