Strategy to select most efficient RCT samples based on observational data

11/09/2022
by   Wenqi Shi, et al.
0

Randomized experiments can provide unbiased estimates of sample average treatment effects. However, estimates of population treatment effects can be biased when the experimental sample and the target population differ. In this case, the population average treatment effect can be identified by combining experimental and observational data. A good experiment design trumps all the analyses that come after. While most of the existing literature centers around improving analyses after RCTs, we instead focus on the design stage, fundamentally improving the efficiency of the combined causal estimator through the selection of experimental samples. We explore how the covariate distribution of RCT samples influences the estimation efficiency and derive the optimal covariate allocation that leads to the lowest variance. Our results show that the optimal allocation does not necessarily follow the exact distribution of the target cohort, but adjusted for the conditional variability of potential outcomes. We formulate a metric to compare and choose from candidate RCT sample compositions. We also develop variations of our main results to cater for practical scenarios with various cost constraints and precision requirements. The ultimate goal of this paper is to provide practitioners with a clear and actionable strategy to select RCT samples that will lead to efficient causal inference.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/06/2023

Combining observational and experimental data for causal inference considering data privacy

Combining observational and experimental data for causal inference can i...
research
09/08/2020

Designing Transportable Experiments

We consider the problem of designing a randomized experiment on a source...
research
09/05/2019

Covariate Selection for Generalizing Experimental Results

Scientists are interested in generalizing causal effects estimated in an...
research
01/12/2023

A Framework for Generalization and Transportation of Causal Estimates Under Covariate Shift

Randomized experiments are an excellent tool for estimating internally v...
research
06/09/2023

Using Auxiliary Data to Boost Precision in the Analysis of A/B Tests on an Online Educational Platform: New Data and New Results

Randomized A/B tests within online learning platforms represent an excit...
research
07/05/2020

Robust Causal Inference Under Covariate Shift via Worst-Case Subpopulation Treatment Effects

We propose the worst-case treatment effect (WTE) across all subpopulatio...
research
07/19/2021

Causal Inference Struggles with Agency on Online Platforms

Online platforms regularly conduct randomized experiments to understand ...

Please sign up or login with your details

Forgot password? Click here to reset