RCT Rejection Sampling for Causal Estimation Evaluation

07/27/2023
by   Katherine A. Keith, et al.
0

Confounding is a significant obstacle to unbiased estimation of causal effects from observational data. For settings with high-dimensional covariates – such as text data, genomics, or the behavioral social sciences – researchers have proposed methods to adjust for confounding by adapting machine learning methods to the goal of causal estimation. However, empirical evaluation of these adjustment methods has been challenging and limited. In this work, we build on a promising empirical evaluation strategy that simplifies evaluation design and uses real data: subsampling randomized controlled trials (RCTs) to create confounded observational datasets while using the average causal effects from the RCTs as ground-truth. We contribute a new sampling algorithm, which we call RCT rejection sampling, and provide theoretical guarantees that causal identification holds in the observational data to allow for valid comparisons to the ground-truth RCT. Using synthetic data, we show our algorithm indeed results in low bias when oracle estimators are evaluated on the confounded samples, which is not always the case for a previously proposed algorithm. In addition to this identification result, we highlight several finite data considerations for evaluation designers who plan to use RCT rejection sampling on their own datasets. As a proof of concept, we implement an example evaluation pipeline and walk through these finite data considerations with a novel, real-world RCT – which we release publicly – consisting of approximately 70k observations and text data as high-dimensional covariates. Together, these contributions build towards a broader agenda of improved empirical evaluation for causal estimation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/18/2020

Learning Adjustment Sets from Observational and Limited Experimental Data

Estimating causal effects from observational data is not always possible...
research
09/27/2022

Falsification before Extrapolation in Causal Effect Estimation

Randomized Controlled Trials (RCTs) represent a gold standard when devel...
research
10/06/2020

Using Experimental Data to Evaluate Methods for Observational Causal Inference

Methods that infer causal dependence from observational data are central...
research
09/21/2020

Adjusting for Confounders with Text: Challenges and an Empirical Evaluation Framework for Causal Inference

Leveraging text, such as social media posts, for causal inferences requi...
research
09/06/2020

Discovering Reliable Causal Rules

We study the problem of deriving policies, or rules, that when enacted o...
research
12/05/2018

On High Dimensional Covariate Adjustment for Estimating Causal Effects in Randomized Trials with Survival Outcomes

We study the estimation of the average causal effect (ACE) on the surviv...
research
06/14/2017

Bias and high-dimensional adjustment in observational studies of peer effects

Peer effects, in which the behavior of an individual is affected by the ...

Please sign up or login with your details

Forgot password? Click here to reset