Controlling the FDR in variable selection via multiple knockoffs

11/21/2019
by   Kristen Emery, et al.
0

Barber and Candes recently introduced a feature selection method called knockoff+ that controls the false discovery rate (FDR) among the selected features in the classical linear regression problem. Knockoff+ uses the competition between the original features and artificially created knockoff features to control the FDR [1]. We generalize Barber and Candes' knockoff construction to generate multiple knockoffs and use those in conjunction with a recently developed general framework for multiple competition-based FDR control [9]. We prove that using our initial multiple-knockoff construction the combined procedure rigorously controls the FDR in the finite sample setting. Because this construction has a somewhat limited utility we introduce a heuristic we call "batching" which significantly improves the power of our multiple-knockoff procedures. Finally, we combine the batched knockoffs with a new context-dependent resampling scheme that replaces the generic resampling scheme used in the general multiple-competition setup. We show using simulations that the resulting "multi-knockoff-select" procedure empirically controls the FDR in the finite setting of the variable selection problem while often delivering substantially more power than knockoff+.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 37

page 38

page 40

07/02/2019

Multiple competition based FDR control

Competition based FDR control has been commonly used for over a decade i...
11/24/2020

Competition-based control of the false discovery proportion

Target-decoy competition (TDC) is commonly used in the computational mas...
06/03/2021

Normalizing Flows for Knockoff-free Controlled Feature Selection

The goal of controlled feature selection is to discover the features a r...
03/06/2022

Variable Selection with the Knockoffs: Composite Null Hypotheses

The Fixed-X knockoff filter is a flexible framework for variable selecti...
02/21/2020

Aggregation of Multiple Knockoffs

We develop an extension of the Knockoff Inference procedure, introduced ...
08/24/2021

A Generalized Knockoff Procedure for FDR Control in Structural Change Detection

Controlling false discovery rate (FDR) is crucial for variable selection...
03/30/2021

Controlling the False Discovery Rate in Structural Sparsity: Split Knockoffs

Controlling the False Discovery Rate (FDR) in a variable selection proce...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.