Power analysis of knockoff filters for correlated designs

10/28/2019
by   Jingbo Liu, et al.
0

The knockoff filter introduced by Barber and Candès 2016 is an elegant framework for controlling the false discovery rate in variable selection. While empirical results indicate that this methodology is not too conservative, there is no conclusive theoretical result on its power. When the predictors are i.i.d. Gaussian, it is known that as the signal to noise ratio tend to infinity, the knockoff filter is consistent in the sense that one can make FDR go to 0 and power go to 1 simultaneously. In this work we study the case where the predictors have a general covariance matrix . We introduce a simple functional called effective signal deficiency (ESD) of the covariance matrix of the predictors that predicts consistency of various variable selection methods. In particular, ESD reveals that the structure of the precision matrix plays a central role in consistency and therefore, so does the conditional independence structure of the predictors. To leverage this connection, we introduce Conditional Independence knockoff, a simple procedure that is able to compete with the more sophisticated knockoff filters and that is defined when the predictors obey a Gaussian tree graphical models (or when the graph is sufficiently sparse). Our theoretical results are supported by numerical evidence on synthetic data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/20/2020

Joint Bayesian Variable and DAG Selection Consistency for High-dimensional Regression Models with Network-structured Covariates

We consider the joint sparse estimation of regression coefficients and t...
research
12/03/2012

Structure estimation for discrete graphical models: Generalized covariance matrices and their inverses

We investigate the relationship between the structure of a discrete grap...
research
09/12/2021

Differentially Private Variable Selection via the Knockoff Filter

The knockoff filter, recently developed by Barber and Candes, is an effe...
research
10/24/2021

Robust Variable Selection under Cellwise Contamination

Cellwise outliers are widespread in data and traditional robust methods ...
research
03/14/2022

Consistent and scalable Bayesian joint variable and graph selection for disease diagnosis leveraging functional brain network

We consider the joint inference of regression coefficients and the inver...
research
06/30/2021

The geometry of Gaussian double Markovian distributions

Gaussian double Markovian models consist of covariance matrices constrai...
research
10/16/2020

Power of FDR Control Methods: The Impact of Ranking Algorithm, Tampered Design, and Symmetric Statistic

As the power of FDR control methods for high-dimensional variable select...

Please sign up or login with your details

Forgot password? Click here to reset