Outcome model free causal inference with ultra-high dimensional covariates

07/28/2020
by   Dingke Tang, et al.
0

Causal inference has been increasingly reliant on observational studies with rich covariate information. To build tractable causal models, including the propensity score models, it is imperative to first extract important features from high dimensional data. Unlike the familiar task of variable selection for prediction modeling, our feature selection procedure aims to control for confounding while maintaining efficiency in the resulting causal effect estimate. Previous empirical studies imply that one should aim to include all predictors of the outcome, rather than the treatment, in the propensity score model. In this paper, we formalize this intuition through rigorous proofs, and propose the causal ball screening for selecting these variables from modern ultra-high dimensional data sets. A distinctive feature of our proposal is that we do not require any modeling on the outcome regression, thus providing robustness against misspecification of the functional form or violation of smoothness conditions. Our theoretical analyses show that the proposed procedure enjoys a number of oracle properties including model selection consistency, normality and efficiency. Synthetic and real data analyses show that our proposal performs favorably with existing methods in a range of realistic settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/17/2023

A New Covariate Selection Strategy for High Dimensional Data in Causal Effect Estimation with Multivariate Treatments

Selection of covariates is crucial in the estimation of average treatmen...
research
09/11/2021

Propensity Score Adapted Covariate Selection for Causal Inference

In this paper, we propose a propensity score adapted variable selection ...
research
05/08/2019

Consistent Fixed-Effects Selection in Ultra-high dimensional Linear Mixed Models with Error-Covariate Endogeneity

Recently, applied sciences, including longitudinal and clustered studies...
research
06/30/2017

Collaborative-controlled LASSO for Constructing Propensity Score-based Estimators in High-Dimensional Data

Propensity score (PS) based estimators are increasingly used for causal ...
research
12/18/2020

MASSIVE: Tractable and Robust Bayesian Learning of Many-Dimensional Instrumental Variable Models

The recent availability of huge, many-dimensional data sets, like those ...
research
12/26/2016

Generalized Optimal Matching Methods for Causal Inference

We develop an encompassing framework for matching, covariate balancing, ...

Please sign up or login with your details

Forgot password? Click here to reset