Causal Inference on Multivariate and Mixed-Type Data

02/21/2017
by   Alexander Marx, et al.
0

Given data over the joint distribution of two random variables X and Y, we consider the problem of inferring the most likely causal direction between X and Y. In particular, we consider the general case where both X and Y may be univariate or multivariate, and of the same or mixed data types. We take an information theoretic approach, based on Kolmogorov complexity, from which it follows that first describing the data over cause and then that of effect given cause is shorter than the reverse direction. The ideal score is not computable, but can be approximated through the Minimum Description Length (MDL) principle. Based on MDL, we propose two scores, one for when both X and Y are of the same single data type, and one for when they are mixed-type. We model dependencies between X and Y using classification and regression trees. As inferring the optimal model is NP-hard, we propose Crack, a fast greedy algorithm to determine the most likely causal direction directly from the data. Empirical evaluation on a wide range of data shows that Crack reliably, and with high accuracy, infers the correct causal direction on both univariate and multivariate cause-effect pairs over both single and mixed-type data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/26/2017

Telling Cause from Effect using MDL-based Local and Global Regression

We consider the fundamental problem of inferring the causal direction be...
research
02/22/2017

Causal Inference by Stochastic Complexity

The algorithmic Markov condition states that the most likely causal dire...
research
01/21/2019

We Are Not Your Real Parents: Telling Causal from Confounded using MDL

Given data over variables (X_1,...,X_m, Y) we consider the problem of fi...
research
02/23/2020

A Critical View of the Structural Causal Model

In the univariate case, we show that by comparing the individual complex...
research
10/12/2020

Inferring Causal Direction from Observational Data: A Complexity Approach

At the heart of causal structure learning from observational data lies a...
research
07/29/2020

Information-Theoretic Approximation to Causal Models

Inferring the causal direction and causal effect between two discrete ra...
research
09/05/2023

Granger Causal Inference in Multivariate Hawkes Processes by Minimum Message Length

Multivariate Hawkes processes (MHPs) are versatile probabilistic tools u...

Please sign up or login with your details

Forgot password? Click here to reset