Leveraging Observational Data for Efficient CATE Estimation in Randomized Controlled Trials

06/30/2023
by   Amir Asiaee, et al.
0

Randomized controlled trials (RCTs) are the gold standard for causal inference, but they are often powered only for average effects, making estimation of heterogeneous treatment effects (HTEs) challenging. Conversely, large-scale observational studies (OS) offer a wealth of data but suffer from confounding bias. Our paper presents a novel framework to leverage OS data for enhancing the efficiency in estimating conditional average treatment effects (CATEs) from RCTs while mitigating common biases. We propose an innovative approach to combine RCTs and OS data, expanding the traditionally used control arms from external sources. The framework relaxes the typical assumption of CATE invariance across populations, acknowledging the often unaccounted systematic differences between RCT and OS participants. We demonstrate this through the special case of a linear outcome model, where the CATE is sparsely different between the two populations. The core of our framework relies on learning potential outcome means from OS data and using them as a nuisance parameter in CATE estimation from RCT data. We further illustrate through experiments that using OS findings reduces the variance of the estimated CATE from RCTs and can decrease the required sample size for detecting HTEs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/16/2020

Causal inference methods for combining randomized trials and observational studies: a review

With increasing data availability, treatment causal effects can be evalu...
research
09/27/2022

Falsification before Extrapolation in Causal Effect Estimation

Randomized Controlled Trials (RCTs) represent a gold standard when devel...
research
12/09/2021

CoBWeb: a user-friendly web application to estimate causal treatment effects from observational data using multiple algorithms

Background/aims: While randomized controlled trials are the gold standar...
research
04/05/2023

Many Data: Combine Experimental and Observational Data through a Power Likelihood

Randomized controlled trials are commonly regarded as the gold standard ...
research
02/05/2021

Randomized Controlled Trials with Minimal Data Retention

Amidst rising appreciation for privacy and data usage rights, researcher...
research
12/02/2021

Using Ecometric Data to Explore Sources of Cross-Site Impact Variance in Multi-Site Trials

A new method is proposed to explore sources of cross-site impact varianc...
research
12/08/2021

Non parametric estimation of causal populations in a counterfactual scenario

In causality, estimating the effect of a treatment without confounding i...

Please sign up or login with your details

Forgot password? Click here to reset