Controlling the FDR in variable selection via multiple knockoffs

11/21/2019
by   Kristen Emery, et al.
0

Barber and Candes recently introduced a feature selection method called knockoff+ that controls the false discovery rate (FDR) among the selected features in the classical linear regression problem. Knockoff+ uses the competition between the original features and artificially created knockoff features to control the FDR [1]. We generalize Barber and Candes' knockoff construction to generate multiple knockoffs and use those in conjunction with a recently developed general framework for multiple competition-based FDR control [9]. We prove that using our initial multiple-knockoff construction the combined procedure rigorously controls the FDR in the finite sample setting. Because this construction has a somewhat limited utility we introduce a heuristic we call "batching" which significantly improves the power of our multiple-knockoff procedures. Finally, we combine the batched knockoffs with a new context-dependent resampling scheme that replaces the generic resampling scheme used in the general multiple-competition setup. We show using simulations that the resulting "multi-knockoff-select" procedure empirically controls the FDR in the finite setting of the variable selection problem while often delivering substantially more power than knockoff+.

READ FULL TEXT

page 37

page 38

page 40

research
07/02/2019

Multiple competition based FDR control

Competition based FDR control has been commonly used for over a decade i...
research
11/24/2020

Competition-based control of the false discovery proportion

Target-decoy competition (TDC) is commonly used in the computational mas...
research
03/06/2022

Variable Selection with the Knockoffs: Composite Null Hypotheses

The Fixed-X knockoff filter is a flexible framework for variable selecti...
research
12/04/2020

Derandomizing Knockoffs

Model-X knockoffs is a general procedure that can leverage any feature i...
research
06/03/2021

Normalizing Flows for Knockoff-free Controlled Feature Selection

The goal of controlled feature selection is to discover the features a r...
research
02/21/2020

Aggregation of Multiple Knockoffs

We develop an extension of the Knockoff Inference procedure, introduced ...

Please sign up or login with your details

Forgot password? Click here to reset