Calibrated regression estimation using empirical likelihood under data fusion

04/06/2022
by   Wei Li, et al.
0

Data analysis based on information from several sources is common in economic and biomedical studies. This setting is often referred to as the data fusion problem, which differs from traditional missing data problems since no complete data is observed for any subject. We consider a regression analysis when the outcome variable and some covariates are collected from two different sources. By leveraging the common variables observed in both data sets, doubly robust estimation procedures are proposed in the literature to protect against possible model misspecifications. However, they employ only a single propensity score model for the data fusion process and a single imputation model for the covariates available in one data set. It may be questionable to assume that either model is correctly specified in practice. We therefore propose an approach that calibrates multiple propensity score and imputation models to gain more protection based on empirical likelihood methods. The resulting estimator is consistent when any one of those models is correctly specified and is robust against extreme values of the fitted propensity scores. We also establish its asymptotic normality property and discuss the semiparametric estimation efficiency. Simulation studies show that the proposed estimator has substantial advantages over existing doubly robust estimators, and an assembled U.S. household expenditure data example is used for illustration.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/22/2018

Doubly Robust Regression Analysis for Data Fusion

This paper investigates the problem of making inference about a parametr...
research
10/02/2019

Combining multiple imputation with raking of weights in the setting of nearly-true models

Raking of weights is one approach to using data from the full cohort in ...
research
01/09/2018

"Robust-squared" Imputation Models Using BART

Examples of "doubly robust" estimator for missing data include augmented...
research
07/14/2021

Survey data integration for regression analysis using model calibration

We consider regression analysis in the context of data integration. To c...
research
02/02/2018

A novel approach to estimate the Cox model with temporal covariates and its application to medical cost data

We propose a novel approach to estimate the Cox model with temporal cova...
research
06/02/2020

Cox regression analysis for distorted covariates with an unknown distortion function

We study inference for censored survival data where some covariates are ...

Please sign up or login with your details

Forgot password? Click here to reset