Improving the Stability of the Knockoff Procedure: Multiple Simultaneous Knockoffs and Entropy Maximization

10/26/2018
by   Jaime Roquero Gimenez, et al.
0

The Model-X knockoff procedure has recently emerged as a powerful approach for feature selection with statistical guarantees. The advantage of knockoff is that if we have a good model of the features X, then we can identify salient features without knowing anything about how the outcome Y depends on X. An important drawback of knockoffs is its instability: running the procedure twice can result in very different selected features, potentially leading to different conclusions. Addressing this instability is critical for obtaining reproducible and robust results. Here we present a generalization of the knockoff procedure that we call simultaneous multi-knockoffs. We show that multi-knockoff guarantees false discovery rate (FDR) control, and is substantially more stable and powerful compared to the standard (single) knockoff. Moreover we propose a new algorithm based on entropy maximization for generating Gaussian multi-knockoffs. We validate the improved stability and power of multi-knockoffs in systematic experiments. We also illustrate how multi-knockoffs can improve the accuracy of detecting genetic mutations that are causally linked to phenotypes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/29/2019

Discovering Conditionally Salient Features with Statistical Guarantees

The goal of feature selection is to identify important features that are...
research
07/17/2018

Knockoffs for the mass: new feature importance statistics with false discovery guarantees

An important problem in machine learning and statistics is to identify f...
research
04/21/2023

Joint Mirror Procedure: Controlling False Discovery Rate for Identifying Simultaneous Signals

In many applications, identifying a single feature of interest requires ...
research
06/03/2021

Normalizing Flows for Knockoff-free Controlled Feature Selection

The goal of controlled feature selection is to discover the features a r...
research
02/21/2020

Aggregation of Multiple Knockoffs

We develop an extension of the Knockoff Inference procedure, introduced ...
research
01/03/2022

Cluster Stability Selection

Stability selection (Meinshausen and Buhlmann, 2010) makes any feature s...
research
07/10/2023

ARK: Robust Knockoffs Inference with Coupling

We investigate the robustness of the model-X knockoffs framework with re...

Please sign up or login with your details

Forgot password? Click here to reset