Power of FDR Control Methods: The Impact of Ranking Algorithm, Tampered Design, and Symmetric Statistic

10/16/2020
by   Zheng Tracy Ke, et al.
0

As the power of FDR control methods for high-dimensional variable selections has been mostly evaluated empirically, we focus here on theoretical power analyses of two recent such methods, the knockoff filter and the Gaussian mirror. We adopt the Rare/Weak signal model, popular in multiple testing and variable selection literature, and characterize the rate of convergence of the number of false positives and the number of false negatives of FDR control methods for particular classes of designs. Our analyses lead to several noteworthy discoveries. First, the choice of the symmetric statistic in FDR control methods crucially affects the power. Second, with a proper symmetric statistic, the operation of adding "noise" to achieve FDR control yields almost no loss of power compared with its prototype, at least for some special classes of designs. Third, the knockoff filter and Gaussian mirror have comparable power for orthogonal designs, but they behave differently for non-orthogonal designs. We study the block-wise diagonal designs and show that the knockoff filter has a higher power when the regression coefficient vector is extremely sparse, and the Gaussian mirror has a higher power when the coefficient vector is moderately sparse.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/23/2022

Rényi Distillation for Global Testing in Sparse Regression Problems

Many modern high-dimensional regression applications involve testing whe...
research
09/04/2011

Variable Selection in High Dimensions with Random Designs and Orthogonal Matching Pursuit

The performance of Orthogonal Matching Pursuit (OMP) for variable select...
research
08/23/2021

StarTrek: Combinatorial Variable Selection with False Discovery Rate Control

Variable selection on the large-scale networks has been extensively stud...
research
07/02/2020

A Scale-free Approach for False Discovery Rate Control in Generalized Linear Models

The generalized linear models (GLM) have been widely used in practice to...
research
02/20/2020

False Discovery Rate Control via Data Splitting

Selecting relevant features associated with a given response variable is...
research
10/28/2019

Power analysis of knockoff filters for correlated designs

The knockoff filter introduced by Barber and Candès 2016 is an elegant f...
research
06/30/2021

Whiteout: when do fixed-X knockoffs fail?

A core strength of knockoff methods is their virtually limitless customi...

Please sign up or login with your details

Forgot password? Click here to reset