Blocking, rerandomization, and regression adjustment in randomized experiments with high-dimensional covariates

09/23/2021
by   Ke Zhu, et al.
0

Blocking, a special case of rerandomization, is routinely implemented in the design stage of randomized experiments to balance baseline covariates. Regression adjustment is highly encouraged in the analysis stage to adjust for the remaining covariate imbalances. Researchers have recommended combining these techniques; however, the research on this combination in a randomization-based inference framework with a large number of covariates is limited. This paper proposes several methods that combine the blocking, rerandomization, and regression adjustment techniques in randomized experiments with high-dimensional covariates. In the design stage, we suggest the implementation of blocking or rerandomization or both techniques to balance a fixed number of covariates most relevant to the outcomes. For the analysis stage, we propose regression adjustment methods based on the Lasso to adjust for the remaining imbalances in the additional high-dimensional covariates. Moreover, we establish the asymptotic properties of the proposed Lasso-adjusted average treatment effect estimators and outline conditions under which these estimators are more efficient than the unadjusted estimators. In addition, we provide conservative variance estimators to facilitate valid inferences. Our analysis is randomization-based, allowing the outcome data generating models to be mis-specified. Simulation studies and two real data analyses demonstrate the advantages of the proposed methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/19/2020

A general theory of regression adjustment for covariate-adaptive randomization: OLS, Lasso, and beyond

We consider the problem of estimating and inferring treatment effects in...
research
06/26/2019

Rerandomization and Regression Adjustment

Randomization is a basis for the statistical inference of treatment effe...
research
07/16/2020

Principled Selection of Baseline Covariates to Account for Censoring in Randomized Trials with a Survival Endpoint

The analysis of randomized trials with time-to-event endpoints is nearly...
research
09/17/2021

Regression Discontinuity Design with Potentially Many Covariates

This paper studies the case of possibly high-dimensional covariates in t...
research
11/10/2020

Rerandomization in stratified randomized experiments

Stratification and rerandomization are two well-known methods used in ra...
research
10/26/2021

Towards Optimal Variance Reduction in Online Controlled Experiments

We study the optimal variance reduction solutions for online controlled ...

Please sign up or login with your details

Forgot password? Click here to reset