On doubly robust inference for double machine learning

07/13/2021
by   Oliver Dukes, et al.
0

Due to concerns about parametric model misspecification, there is interest in using machine learning to adjust for confounding when evaluating the causal effect of an exposure on an outcome. Unfortunately, exposure effect estimators that rely on machine learning predictions are generally subject to so-called plug-in bias, which can render naive p-values and confidence intervals invalid. Progress has been made via proposals like targeted maximum likelihood estimation and more recently double machine learning, which rely on learning the conditional mean of both the outcome and exposure. Valid inference can then be obtained so long as both predictions converge (sufficiently fast) to the truth. Focusing on partially linear regression models, we show that a specific implementation of the machine learning techniques can yield exposure effect estimators that have small bias even when one of the first-stage predictions does not converge to the truth. The resulting tests and confidence intervals are doubly robust. We also show that the proposed estimators may fail to be regular when only one nuisance parameter is consistently estimated; nevertheless, we observe in simulation studies that our proposal leads to reduced bias and improved confidence interval coverage in moderate samples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/21/2020

Machine learning for causal inference: on the use of cross-fit estimators

Modern causal inference methods allow machine learning to be used to wea...
research
01/29/2021

Regularizing Double Machine Learning in Partially Linear Endogenous Models

We estimate the linear coefficient in a partially linear model with conf...
research
11/25/2021

Network regression and supervised centrality estimation

The centrality in a network is a popular metric for agents' network posi...
research
10/18/2021

Double Robust Mass-Imputation with Matching Estimators

This paper proposes using a method named Double Score Matching (DSM) to ...
research
02/15/2023

Cross-Validated Decision Trees with Targeted Maximum Likelihood Estimation for Nonparametric Causal Mixtures Analysis

Exposure to mixtures of chemicals, such as drugs, pollutants, and nutrie...
research
04/08/2019

On assumption-free tests and confidence intervals for causal effects estimated by machine learning

For many causal effect parameters ψ of interest doubly robust machine le...

Please sign up or login with your details

Forgot password? Click here to reset