Collaborative causal inference with a distributed data-sharing management

04/02/2022
by   Mengtong Hu, et al.
0

Data sharing barriers are paramount challenges arising from multicenter clinical trials where multiple data sources are stored in a distributed fashion at different local study sites. Merging such data sources into a common data storage for a centralized statistical analysis requires a data use agreement, which is often time-consuming. Data merging may become more burdensome when causal inference is of primary interest because propensity score modeling involves combining many confounding variables, and systematic incorporation of this additional modeling in meta-analysis has not been thoroughly investigated in the literature. We propose a new causal inference framework that avoids the merging of subject-level raw data from multiple sites but needs only the sharing of summary statistics. The proposed collaborative inference enjoys maximal protection of data privacy and minimal sensitivity to unbalanced data distributions across data sources. We show theoretically and numerically that the new distributed causal inference approach has little loss of statistical power compared to the centralized method that requires merging the entire data. We present large-sample properties and algorithms for the proposed method. We illustrate its performance by simulation experiments and a real-world data example on a multicenter clinical trial of basal insulin treatment for reducing the risk of post-transplantation diabetes among kidney-transplant patients.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/01/2023

An Adaptive Kernel Approach to Federated Learning of Heterogeneous Causal Effects

We propose a new causal inference framework to learn causal effects from...
research
07/16/2020

Do-search – a tool for causal inference and study design with multiple data sources

Epidemiological evidence is based on multiple data sources including cli...
research
08/16/2022

Collaborative causal inference on distributed data

The development of technologies for causal inference with the privacy pr...
research
05/31/2021

Federated Estimation of Causal Effects from Observational Data

Many modern applications collect data that comes in federated spirit, wi...
research
01/15/2022

Automated causal inference in application to randomized controlled clinical trials

Randomized controlled trials (RCTs) are considered as the gold standard ...
research
01/02/2023

Robust Inference for Federated Meta-Learning

Synthesizing information from multiple data sources is critical to ensur...
research
05/31/2021

Adaptive Multi-Source Causal Inference

Data scarcity is a tremendous challenge in causal effect estimation. In ...

Please sign up or login with your details

Forgot password? Click here to reset