Double and Single Descent in Causal Inference with an Application to High-Dimensional Synthetic Control

05/01/2023
by   Jann Spiess, et al.
0

Motivated by a recent literature on the double-descent phenomenon in machine learning, we consider highly over-parametrized models in causal inference, including synthetic control with many control units. In such models, there may be so many free parameters that the model fits the training data perfectly. As a motivating example, we first investigate high-dimensional linear regression for imputing wage data, where we find that models with many more covariates than sample size can outperform simple ones. As our main contribution, we document the performance of high-dimensional synthetic control estimators with many control units. We find that adding control units can help improve imputation performance even beyond the point where the pre-treatment fit is perfect. We then provide a unified theoretical perspective on the performance of these high-dimensional models. Specifically, we show that more complex models can be interpreted as model-averaging estimators over simpler ones, which we link to an improvement in average performance. This perspective yields concrete insights into the use of synthetic control when control units are many relative to the number of pre-treatment periods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/19/2022

Causal Inference from Small High-dimensional Datasets

Many methods have been proposed to estimate treatment effects with obser...
research
04/18/2021

Least Squares with Error in Variables

Error-in-variables regression is a common ingredient in treatment effect...
research
03/01/2022

Neural Score Matching for High-Dimensional Causal Inference

Traditional methods for matching in causal inference are impractical for...
research
05/25/2020

An alternative to synthetic control for models with many covariates under sparsity

The synthetic control method is a an econometric tool to evaluate causal...
research
12/28/2017

Orthogonal Machine Learning for Demand Estimation: High Dimensional Causal Inference in Dynamic Panels

There has been growing interest in how economists can import machine lea...
research
04/17/2020

Causal Inference in Case-Control Studies

We investigate identification of causal parameters in case-control and r...
research
10/10/2022

Uncertainty Quantification in Synthetic Controls with Staggered Treatment Adoption

We propose principled prediction intervals to quantify the uncertainty o...

Please sign up or login with your details

Forgot password? Click here to reset