Efficient and Robust Semi-supervised Estimation of ATE with Partially Annotated Treatment and Response

10/24/2021
by   Jue Hou, et al.
0

A notable challenge of leveraging Electronic Health Records (EHR) for treatment effect assessment is the lack of precise information on important clinical variables, including the treatment received and the response. Both treatment information and response often cannot be accurately captured by readily available EHR features and require labor intensive manual chart review to precisely annotate, which limits the number of available gold standard labels on these key variables. We consider average treatment effect (ATE) estimation under such a semi-supervised setting with a large number of unlabeled samples containing both confounders and imperfect EHR features for treatment and response. We derive the efficient influence function for ATE and use it to construct a semi-supervised multiple machine learning (SMMAL) estimator. We showcase that our SMMAL estimator is semi-parametric efficient with B-spline regression under low-dimensional smooth models. We develop the adaptive sparsity/model doubly robust estimation under high-dimensional logistic propensity score and outcome regression models. Results from simulation studies support the validity of our SMMAL method and its superiority over supervised benchmarks.

READ FULL TEXT
research
03/31/2018

Efficient and Robust Semi-Supervised Estimation of Average Treatment Effects in Electronic Medical Records Data

There is strong interest in conducting comparative effectiveness researc...
research
03/26/2020

Prior Adaptive Semi-supervised Learning with Application to EHR Phenotyping

Electronic Health Records (EHR) data, a rich source for biomedical resea...
research
03/07/2021

Risk Prediction with Imperfect Survival Outcome Information from Electronic Health Records

Readily available proxies for time of disease onset such as time of the ...
research
03/04/2022

Adaptive Semi-Supervised Inference for Optimal Treatment Decisions with Electronic Medical Record Data

A treatment regime is a rule that assigns a treatment to patients based ...
research
05/11/2020

Counterfactual Propagation for Semi-Supervised Individual Treatment Effect Estimation

Individual treatment effect (ITE) represents the expected improvement in...
research
02/09/2023

Surrogate-Assisted Federated Learning of high dimensional Electronic Health Record Data

Surrogate variables in electronic health records (EHR) play an important...
research
10/18/2021

Semi-supervised Approach to Event Time Annotation Using Longitudinal Electronic Health Records

Large clinical datasets derived from insurance claims and electronic hea...

Please sign up or login with your details

Forgot password? Click here to reset