Omitted Variable Bias in Machine Learned Causal Models

12/26/2021
by   Victor Chernozhukov, et al.
4

We derive general, yet simple, sharp bounds on the size of the omitted variable bias for a broad class of causal parameters that can be identified as linear functionals of the conditional expectation function of the outcome. Such functionals encompass many of the traditional targets of investigation in causal inference studies, such as, for example, (weighted) average of potential outcomes, average treatment effects (including subgroup effects, such as the effect on the treated), (weighted) average derivatives, and policy effects from shifts in covariate distribution – all for general, nonparametric causal models. Our construction relies on the Riesz-Frechet representation of the target functional. Specifically, we show how the bound on the bias depends only on the additional variation that the latent variables create both in the outcome and in the Riesz representer for the parameter of interest. Moreover, in many important cases (e.g, average treatment effects in partially linear models, or in nonseparable models with a binary treatment) the bound is shown to depend on two easily interpretable quantities: the nonparametric partial R^2 (Pearson's "correlation ratio") of the unobserved variables with the treatment and with the outcome. Therefore, simple plausibility judgments on the maximum explanatory power of omitted variables (in explaining treatment and outcome variation) are sufficient to place overall bounds on the size of the bias. Finally, leveraging debiased machine learning, we provide flexible and efficient statistical inference methods to estimate the components of the bounds that are identifiable from the observed distribution.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/09/2022

Estimating Heterogeneous Bounds for Treatment Effects under Sample Selection and Non-response

In this paper we propose a method for nonparametric estimation and infer...
research
06/05/2019

Measurement errors in the binary instrumental variable model

Instrumental variable methods can identify causal effects even when the ...
research
07/15/2021

Obtaining Causal Information by Merging Datasets with MAXENT

The investigation of the question "which treatment has a causal effect o...
research
10/28/2020

Deep Learning for Individual Heterogeneity

We propose a methodology for effectively modeling individual heterogenei...
research
10/14/2022

Partial Identification of Treatment Effects with Implicit Generative Models

We consider the problem of partial identification, the estimation of bou...
research
10/03/2018

A General Weighted Average Representation of the Ordinary and Two-Stage Least Squares Estimands

It is standard practice in applied work to study the effect of a binary ...
research
12/26/2022

Orthogonal Series Estimation for the Ratio of Conditional Expectation Functions

In various fields of data science, researchers are often interested in e...

Please sign up or login with your details

Forgot password? Click here to reset