Causal Regularization: On the trade-off between in-sample risk and out-of-sample risk guarantees

05/03/2022
by   Lucas Kania, et al.
0

In recent decades, a number of ways of dealing with causality in practice, such as propensity score matching, the PC algorithm and invariant causal prediction, have been introduced. Besides its interpretational appeal, the causal model provides the best out-of-sample prediction guarantees. In this paper, we study the identification of causal-like models from in-sample data that provide out-of-sample risk guarantees when predicting a target variable from a set of covariates. Whereas ordinary least squares provides the best in-sample risk with limited out-of-sample guarantees, causal models have the best out-of-sample guarantees but achieve an inferior in-sample risk. By defining a trade-off of these properties, we introduce causal regularization. As the regularization is increased, it provides estimators whose risk is more stable across sub-samples at the cost of increasing their overall in-sample risk. The increased risk stability is shown to lead to out-of-sample risk guarantees. We provide finite sample risk bounds for all models and prove the adequacy of cross-validation for attaining these bounds.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/14/2020

Provable More Data Hurt in High Dimensional Least Squares Estimator

This paper investigates the finite-sample prediction risk of the high-di...
research
05/07/2020

Distributional Robustness of K-class Estimators and the PULSE

In causal settings, such as instrumental variable settings, it is well k...
research
06/11/2021

Shall we count the living or the dead?

In the 1958 paper "Shall we count the living or the dead", Mindel C. She...
research
11/18/2021

Causal Forecasting:Generalization Bounds for Autoregressive Models

Despite the increasing relevance of forecasting methods, the causal impl...
research
06/15/2022

Finite-Sample Guarantees for High-Dimensional DML

Debiased machine learning (DML) offers an attractive way to estimate tre...
research
06/13/2020

Risk Variance Penalization: From Distributional Robustness to Causality

Learning under multi-environments often requires the ability of out-of-d...
research
06/27/2020

Evaluation of Causal Structure Learning Algorithms via Risk Estimation

Recent years have seen many advances in methods for causal structure lea...

Please sign up or login with your details

Forgot password? Click here to reset