Inference for High-dimensional Maximin Effects in Heterogeneous Regression Models Using a Sampling Approach

11/15/2020
by   Zijian Guo, et al.
0

Heterogeneity is an important feature of modern data sets and a central task is to extract information from large-scale and heterogeneous data. In this paper, we consider multiple high-dimensional linear models and adopt the definition of maximin effect (Meinshausen, Bühlmann, AoS, 43(4), 1801–1830) to summarize the information contained in this heterogeneous model. We define the maximin effect for a targeted population whose covariate distribution is possibly different from that of the observed data. We further introduce a ridge-type maximin effect to simultaneously account for reward optimality and statistical stability. To identify the high-dimensional maximin effect, we estimate the regression covariance matrix by a debiased estimator and use it to construct the aggregation weights for the maximin effect. A main challenge for statistical inference is that the estimated weights might have a mixture distribution and the resulted maximin effect estimator is not necessarily asymptotic normal. To address this, we devise a novel sampling approach to construct the confidence interval for any linear contrast of high-dimensional maximin effects. The coverage and precision properties of the proposed confidence interval are studied. The proposed method is demonstrated over simulations and a genetic data set on yeast colony growth under different environments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/25/2023

Statistical Inference and Large-scale Multiple Testing for High-dimensional Regression Models

This paper presents a selective survey of recent developments in statist...
research
09/07/2022

High-dimensional Inference for Generalized Linear Models with Hidden Confounding

Statistical inferences for high-dimensional regression models have been ...
research
07/30/2019

Local Inference in Additive Models with Decorrelated Local Linear Estimator

Additive models, as a natural generalization of linear regression, have ...
research
05/28/2023

Statistical Inference in High-Dimensional Generalized Linear Models with Asymmetric Link Functions

We have developed a statistical inference method applicable to a broad r...
research
06/28/2017

Asymptotic Confidence Regions for High-dimensional Structured Sparsity

In the setting of high-dimensional linear regression models, we propose ...
research
07/27/2019

Estimating the Random Effect in Big Data Mixed Models

We consider three problems in high-dimensional Gaussian linear mixed mod...
research
02/24/2023

Cox reduction and confidence sets of models: a theoretical elucidation

For sparse high-dimensional regression problems, Cox and Battey [1, 9] e...

Please sign up or login with your details

Forgot password? Click here to reset