Regression adjustment in randomized experiments with a diverging number of covariates

06/20/2018
by   Lihua Lei, et al.
0

Extending R. A. Fisher and D. A. Freedman's results on the analysis of covariance, Lin [2013] proposed an ordinary least squares adjusted estimator of the average treatment effect in completely randomized experiments. His results are appealing because he uses Neyman's randomization model without imposing any modeling assumptions, and the consistency and asymptotic normality of his estimator hold even if the linear model is misspecified. His results hold for a fixed dimension p of the covariates with the sample size n approaching infinity. However, it is common for practitioners to adjust for a large number of covariates to improve efficiency. Therefore, it is important to consider asymptotic regimes allowing for a diverging number of covariates. We show that Lin [2013]'s estimator is consistent when κ p → 0 and asymptotically normal when κ p → 0 under mild moment conditions, where κ is the maximum leverage score of the covariate matrix. In the favorable case where leverage scores are all close, his estimator is consistent when p = o(n / n) and is asymptotically normal when p = o(n^1/2). In addition, we propose a bias-corrected estimator that is consistent when κ p→ 0 and is asymptotically normal, with the same variance in the fixed-p regime, when κ^2 p p → 0. In the favorable case, the latter condition reduces to p = o(n^2/3 / ( n)^1/3), and our simulation confirms that n^2/3 is the phase-transition threshold for Lin [2013]'s central limit theorem. Our analysis requires novel analytic tools for finite population inference and sampling without replacement, which complement and potentially enrich the theory in other areas such as survey sampling, matrix sketching, and transductive learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/05/2019

Regression-adjusted average treatment effect estimates in stratified and sequentially randomized experiments

Stratified and sequentially randomized experiments are widely used in fi...
research
03/09/2019

Two paradoxical results in linear models: the variance inflation factor and the analysis of covariance

A result from a standard linear model course is that the variance of the...
research
01/31/2022

Regression Adjustments under Covariate-Adaptive Randomizations with Imperfect Compliance

We study regression adjustments with additional covariates in randomized...
research
12/01/2021

On the robustness and precision of mixed-model analysis of covariance in cluster-randomized trials

In the analyses of cluster-randomized trials, a standard approach for co...
research
09/04/2023

Challenges of the inconsistency regime: Novel debiasing methods for missing data models

We study semi-parametric estimation of the population mean when data is ...
research
04/17/2023

Adjustment with Many Regressors Under Covariate-Adaptive Randomizations

Our paper identifies a trade-off when using regression adjustments (RAs)...
research
05/20/2015

Network driven sampling; a critical threshold for design effects

Web crawling, snowball sampling, and respondent-driven sampling (RDS) ar...

Please sign up or login with your details

Forgot password? Click here to reset