Causal bootstrapping

10/21/2019
by   Max A. Little, et al.
0

To draw scientifically meaningful conclusions and build reliable models of quantitative phenomena, cause and effect must be taken into consideration (either implicitly or explicitly). This is particularly challenging when the measurements are not from controlled experimental (interventional) settings, since cause and effect can be obscured by spurious, indirect influences. Modern predictive techniques from machine learning are capable of capturing high-dimensional, nonlinear relationships between variables while relying on few parametric or probabilistic model assumptions. However, since these techniques are associational, applied to observational data they are prone to picking up spurious influences from non-experimental (observational) data, making their predictions unreliable. Techniques from causal inference, such as probabilistic causal diagrams and do-calculus, provide powerful (nonparametric) tools for drawing causal inferences from such observational data. However, these techniques are often incompatible with modern, nonparametric machine learning algorithms since they typically require explicit probabilistic models. Here, we develop causal bootstrapping for augmenting classical nonparametric bootstrap resampling with information on the causal relationship between variables. This makes it possible to resample observational data such that, if it is possible to identify an interventional relationship from that data, new data representing that relationship can be simulated from the original observational data. In this way, we can use modern machine learning algorithms unaltered to make statistically powerful, yet causally-robust, predictions. We develop several causal bootstrapping algorithms for drawing interventional inferences from observational data, for classification and regression problems, and demonstrate, using synthetic and real-world examples, the value of this approach.

READ FULL TEXT
research
05/23/2023

Statistical causal inference methods for observational research in PER: a primer

Recent critiques of Physics Education Research (PER) studies have revoic...
research
12/07/2014

Visual Causal Feature Learning

We provide a rigorous definition of the visual cause of a behavior that ...
research
10/15/2021

Identifying Causal Influences on Publication Trends and Behavior: A Case Study of the Computational Linguistics Community

Drawing causal conclusions from observational real-world data is a very ...
research
02/22/2022

Effect Identification in Cluster Causal Diagrams

One pervasive task found throughout the empirical sciences is to determi...
research
05/10/2021

An introduction to causal reasoning in health analytics

A data science task can be deemed as making sense of the data and/or tes...
research
02/22/2020

Causal Inference in Genetic Trio Studies

We introduce a method to rigorously draw causal inferences—inferences im...
research
08/08/2023

SLEM: Machine Learning for Path Modeling and Causal Inference with Super Learner Equation Modeling

Causal inference is a crucial goal of science, enabling researchers to a...

Please sign up or login with your details

Forgot password? Click here to reset