A Decorrelating and Debiasing Approach to Simultaneous Inference for High-Dimensional Confounded Models

08/18/2022
by   Yinrui Sun, et al.
0

Motivated by the simultaneous association analysis with the presence of latent confounders, this paper studies the large-scale hypothesis testing problem for the high-dimensional confounded linear models with both non-asymptotic and asymptotic false discovery control. Such model covers a wide range of practical settings where both the response and the predictors may be confounded. In the presence of the high-dimensional predictors and the unobservable confounders, the simultaneous inference with provable guarantees becomes highly challenging, and the unknown strong dependency among the confounded covariates makes the challenge even more pronounced. This paper first introduces a decorrelating procedure that shrinks the confounding effect and weakens the correlations among the predictors, then performs debiasing under the decorrelated design based on some biased initial estimator. Standardized test statistics are then constructed and the corresponding asymptotic normality property is established. Furthermore, a simultaneous inference procedure is proposed to identify significant associations, and both the finite-sample and asymptotic false discovery bounds are provided. The non-asymptotic result is general and model-free, and is of independent interest. We also prove that, under minimal signal strength condition, all associations can be successfully detected with probability tending to one. Simulation studies are carried out to evaluate the performance of the proposed approach and compare it with other competing methods. The proposed procedure is further applied to detect the gene associations with the anti-cancer drug sensitivities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/02/2020

Integrative High Dimensional Multiple Testing with Heterogeneity under Data Sharing Constraints

Identifying informative predictors in a high dimensional regression mode...
research
09/13/2023

Simultaneous inference for generalized linear models with unmeasured confounders

Tens of thousands of simultaneous hypothesis tests are routinely perform...
research
02/24/2022

Multiple multi-sample testing under arbitrary covariance dependency

Modern high-throughput biomedical devices routinely produce data on a la...
research
10/21/2019

Hypothesis Testing in High-Dimensional Instrumental Variables Regression with an Application to Genomics Data

Gene expression and phenotype association can be affected by potential u...
research
08/18/2021

Multiple two-sample testing under arbitrary covariance dependency with an application in imaging mass spectrometry

Large-scale hypothesis testing has become a ubiquitous problem in high-d...
research
12/23/2019

Simultaneous Inference for Empirical Best Predictors with a Poverty Study in Small Areas

Today, generalized linear mixed models are broadly used in many fields. ...
research
12/21/2021

Efficient Estimation of the Maximal Association between Multiple Predictors and a Survival Outcome

This paper develops a new approach to post-selection inference for scree...

Please sign up or login with your details

Forgot password? Click here to reset