High-Dimensional Feature Selection for Sample Efficient Treatment Effect Estimation

11/03/2020
by   Kristjan Greenewald, et al.
9

The estimation of causal treatment effects from observational data is a fundamental problem in causal inference. To avoid bias, the effect estimator must control for all confounders. Hence practitioners often collect data for as many covariates as possible to raise the chances of including the relevant confounders. While this addresses the bias, this has the side effect of significantly increasing the number of data samples required to accurately estimate the effect due to the increased dimensionality. In this work, we consider the setting where out of a large number of covariates X that satisfy strong ignorability, an unknown sparse subset S is sufficient to include to achieve zero bias, i.e. c-equivalent to X. We propose a common objective function involving outcomes across treatment cohorts with nonconvex joint sparsity regularization that is guaranteed to recover S with high probability under a linear outcome model for Y and subgaussian covariates for each of the treatment cohort. This improves the effect estimation sample complexity so that it scales with the cardinality of the sparse subset S and log |X|, as opposed to the cardinality of the full set X. We validate our approach with experiments on treatment effect estimation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/22/2020

Hi-CI: Deep Causal Inference in High Dimensions

We address the problem of counterfactual regression using causal inferen...
research
07/06/2020

Treatment effect bias from sample snooping: blinding outcomes is neither necessary nor sufficient

Popular guidance on observational data analysis states that outcomes sho...
research
05/10/2021

Model-Assisted Uniformly Honest Inference for Optimal Treatment Regimes in High Dimension

This paper develops new tools to quantify uncertainty in optimal decisio...
research
01/24/2020

Confounder selection strategies targeting stable treatment effect estimators

Propensity score methods are widely adopted in observational studies to ...
research
05/23/2022

An improved neural network model for treatment effect estimation

Nowadays, in many scientific and industrial fields there is an increasin...
research
03/23/2022

Treatment Effect Estimation with Efficient Data Aggregation

Data aggregation, also known as meta analysis, is widely used to synthes...
research
11/03/2021

Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data

Estimating personalized treatment effects from high-dimensional observat...

Please sign up or login with your details

Forgot password? Click here to reset