Directional FDR Control for Sub-Gaussian Sparse GLMs

05/02/2021
by   Chang Cui, et al.
0

High-dimensional sparse generalized linear models (GLMs) have emerged in the setting that the number of samples and the dimension of variables are large, and even the dimension of variables grows faster than the number of samples. False discovery rate (FDR) control aims to identify some small number of statistically significantly nonzero results after getting the sparse penalized estimation of GLMs. Using the CLIME method for precision matrix estimations, we construct the debiased-Lasso estimator and prove the asymptotical normality by minimax-rate oracle inequalities for sparse GLMs. In practice, it is often needed to accurately judge each regression coefficient's positivity and negativity, which determines whether the predictor variable is positively or negatively related to the response variable conditionally on the rest variables. Using the debiased estimator, we establish multiple testing procedures. Under mild conditions, we show that the proposed debiased statistics can asymptotically control the directional (sign) FDR and directional false discovery variables at a pre-specified significance level. Moreover, it can be shown that our multiple testing procedure can approximately achieve a statistical power of 1. We also extend our methods to the two-sample problems and propose the two-sample test statistics. Under suitable conditions, we can asymptotically achieve directional FDR control and directional FDV control at the specified significance level for two-sample problems. Some numerical simulations have successfully verified the FDR control effects of our proposed testing procedures, which sometimes outperforms the classical knockoff method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/12/2018

False Discovery Rate Control via Debiased Lasso

We consider the problem of variable selection in high-dimensional statis...
research
12/08/2017

False Discovery Control for Pairwise Comparisons - An Asymptotic Solution to Williams, Jones and Tukey's Conjecture

Under weak moment and asymptotic conditions, we offer an affirmative ans...
research
11/04/2022

Near-optimal multiple testing in Bayesian linear models with finite-sample FDR control

In high dimensional variable selection problems, statisticians often see...
research
11/21/2019

Controlling False Discovery Rate Using Gaussian Mirrors

Simultaneously finding multiple influential variables and controlling th...
research
12/18/2017

A Power and Prediction Analysis for Knockoffs with Lasso Statistics

Knockoffs is a new framework for controlling the false discovery rate (F...
research
03/27/2023

Discovering the Network Granger Causality in Large Vector Autoregressive Models

This paper proposes novel inferential procedures for the network Granger...
research
03/12/2019

ECKO: Ensemble of Clustered Knockoffs for multivariate inference on fMRI data

Continuous improvement in medical imaging techniques allows the acquisit...

Please sign up or login with your details

Forgot password? Click here to reset