Combining observational and experimental data for causal inference considering data privacy

08/06/2023
by   Charlotte Z. Mann, et al.
0

Combining observational and experimental data for causal inference can improve treatment effect estimation. However, many observational data sets cannot be released due to data privacy considerations, so one researcher may not have access to both experimental and observational data. Nonetheless, a small amount of risk of disclosing sensitive information might be tolerable to organizations that house confidential data. In these cases, organizations can employ data privacy techniques, which decrease disclosure risk, potentially at the expense of data utility. In this paper, we explore disclosure limiting transformations of observational data, which can be combined with experimental data to estimate the sample and population average treatment effects. We consider leveraging observational data to improve generalizability of treatment effect estimates when a randomized experiment (RCT) is not representative of the population of interest, and to increase precision of treatment effect estimates. Through simulation studies, we illustrate the trade-off between privacy and utility when employing different disclosure limiting transformations. We find that leveraging transformed observational data in treatment effect estimation can still improve estimation over only using data from an RCT.

READ FULL TEXT
research
11/16/2020

Causal inference methods for combining randomized trials and observational studies: a review

With increasing data availability, treatment causal effects can be evalu...
research
06/29/2022

Treatment Effect Estimation from Observational Network Data using Augmented Inverse Probability Weighting and Machine Learning

Causal inference methods for treatment effect estimation usually assume ...
research
11/09/2022

Strategy to select most efficient RCT samples based on observational data

Randomized experiments can provide unbiased estimates of sample average ...
research
07/25/2021

Federated Causal Inference in Heterogeneous Observational Data

Analyzing observational data from multiple sources can be useful for inc...
research
10/10/2020

Combining Observational and Experimental Data Using First-stage Covariates

Randomized controlled trials generate experimental variation that can cr...
research
11/12/2018

When do Words Matter? Understanding the Impact of Lexical Choice on Audience Perception using Individual Treatment Effect Estimation

Studies across many disciplines have shown that lexical choice can affec...
research
07/17/2019

Assessing Treatment Effect Variation in Observational Studies: Results from a Data Challenge

A growing number of methods aim to assess the challenging question of tr...

Please sign up or login with your details

Forgot password? Click here to reset