A Targeted Approach to Confounder Selection for High-Dimensional Data

12/15/2021
by   Asad Haris, et al.
0

We consider the problem of selecting confounders for adjustment from a potentially large set of covariates, when estimating a causal effect. Recently, the high-dimensional Propensity Score (hdPS) method was developed for this task; hdPS ranks potential confounders by estimating an importance score for each variable and selects the top few variables. However, this ranking procedure is limited: it requires all variables to be binary. We propose an extension of the hdPS to general types of response and confounder variables. We further develop a group importance score, allowing us to rank groups of potential confounders. The main challenge is that our parameter requires either the propensity score or response model; both vulnerable to model misspecification. We propose a targeted maximum likelihood estimator (TMLE) which allows the use of nonparametric, machine learning tools for fitting these intermediate models. We establish asymptotic normality of our estimator, which consequently allows constructing confidence intervals. We complement our work with numerical studies on simulated and real data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/06/2013

CAM: Causal additive models, high-dimensional order search and penalized regression

We develop estimation for potentially high-dimensional additive structur...
research
07/28/2018

Bayesian Sparse Propensity Score Estimation for Unit Nonresponse

Nonresponse weighting adjustment using propensity score is a popular met...
research
01/30/2018

Model-assisted inference for treatment effects using regularized calibrated estimation with high-dimensional data

Consider the problem of estimating average treatment effects when a larg...
research
09/27/2018

Inference for Individual Mediation Effects and Interventional Effects in Sparse High-Dimensional Causal Graphical Models

We consider the problem of identifying intermediate variables (or mediat...
research
07/08/2020

Estimation and inference on high-dimensional individualized treatment rule in observational data using split-and-pooled de-correlated score

With the increasing adoption of electronic health records, there is an i...
research
01/20/2022

Inference in High-dimensional Multivariate Response Regression with Hidden Variables

This paper studies the inference of the regression coefficient matrix un...
research
04/18/2020

Efficient implementation of median bias reduction

In numerous regular statistical models, median bias reduction (Kenne Pag...

Please sign up or login with your details

Forgot password? Click here to reset