Efficient Estimation Under Data Fusion

11/29/2021
by   Sijia Li, et al.
0

We aim to make inferences about a smooth, finite-dimensional parameter by fusing data from multiple sources together. Previous works have studied the estimation of a variety of parameters in similar data fusion settings, including in the estimation of the average treatment effect, optimal treatment rule, and average reward, with the majority of them merging one historical data source with covariates, actions, and rewards and one data source of the same covariates. In this work, we consider the general case where one or more data sources align with each part of the distribution of the target population, for example, the conditional distribution of the reward given actions and covariates. We describe potential gains in efficiency that can arise from fusing these data sources together in a single analysis, which we characterize by a reduction in the semiparametric efficiency bound. We also provide a general means to construct estimators that achieve these bounds. In numerical experiments, we show marked improvements in efficiency from using our proposed estimators rather than their natural alternatives. Finally, we illustrate the magnitude of efficiency gains that can be realized in vaccine immunogenicity studies by fusing data from two HIV vaccine trials.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/28/2023

Data fusion using weakly aligned sources

We introduce a new data fusion method that utilizes multiple data source...
research
03/31/2023

Efficiently transporting average treatment effects using a sufficient subset of effect modifiers

We develop flexible and nonparametric estimators of the average treatmen...
research
09/22/2020

The Role of Propensity Score Structure in Asymptotic Efficiency of Estimated Conditional Quantile Treatment Effect

When a strict subset of covariates are given, we propose conditional qua...
research
08/10/2022

Heterogeneity assessment in causal data fusion problems

Previous works have formalized the conditions under which findings from ...
research
11/19/2020

Sharp bounds for variance of treatment effect estimators in the finite population in the presence of covariates

In the completely randomized experiment, the variances of treatment effe...
research
07/02/2013

Data Fusion by Matrix Factorization

For most problems in science and engineering we can obtain data sets tha...
research
11/13/2020

Nonparametric fusion learning: synthesize inferences from diverse sources using depth confidence distribution

Fusion learning refers to synthesizing inferences from multiple sources ...

Please sign up or login with your details

Forgot password? Click here to reset