Causal inference is not just a statistics problem

04/05/2023
by   Lucy D'Agostino McGowan, et al.
0

This paper introduces a collection of four data sets, similar to Anscombe's Quartet, that aim to highlight the challenges involved when estimating causal effects. Each of the four data sets is generated based on a distinct causal mechanism: the first involves a collider, the second involves a confounder, the third involves a mediator, and the fourth involves the induction of M-Bias by an included factor. The paper includes a mathematical summary of each data set, as well as directed acyclic graphs that depict the relationships between the variables. Despite the fact that the statistical summaries and visualizations for each data set are identical, the true causal effect differs, and estimating it correctly requires knowledge of the data-generating mechanism. These example data sets can help practitioners gain a better understanding of the assumptions underlying causal inference methods and emphasize the importance of gathering more information beyond what can be obtained from statistical tools alone. The paper also includes R code for reproducing all figures and provides access to the data sets themselves through an R package named quartets.

READ FULL TEXT
research
12/12/2020

From controlled to undisciplined data: estimating causal effects in the era of data science using a potential outcome framework

This paper discusses the fundamental principles of causal inference - th...
research
11/21/2022

Applications of statistical causal inference in software engineering

This paper reviews existing work in software engineering that applies st...
research
07/17/2023

An R package for parametric estimation of causal effects

This article explains the usage of R package CausalModels, which is publ...
research
10/16/2012

Causal Discovery of Linear Cyclic Models from Multiple Experimental Data Sets with Overlapping Variables

Much of scientific data is collected as randomized experiments interveni...
research
04/19/2021

Everything Has a Cause: Leveraging Causal Inference in Legal Text Analysis

Causal inference is the process of capturing cause-effect relationship a...
research
03/24/2020

Symbolic Computation of Tight Causal Bounds

Causal inference involves making a set of assumptions about the nature o...
research
05/18/2020

Towards Causal Inference for Spatio-Temporal Data: Conflict and Forest Loss in Colombia

In many data scientific problems, we are interested not only in modeling...

Please sign up or login with your details

Forgot password? Click here to reset