Causal Discovery using Compression-Complexity Measures

10/19/2020
by   Pranay SY, et al.
0

Causal inference is one of the most fundamental problems across all domains of science. We address the problem of inferring a causal direction from two observed discrete symbolic sequences X and Y. We present a framework which relies on lossless compressors for inferring context-free grammars (CFGs) from sequence pairs and quantifies the extent to which the grammar inferred from one sequence compresses the other sequence. We infer X causes Y if the grammar inferred from X better compresses Y than in the other direction. To put this notion to practice, we propose three models that use the Compression-Complexity Measures (CCMs) - Lempel-Ziv (LZ) complexity and Effort-To-Compress (ETC) to infer CFGs and discover causal directions. We evaluate these models on synthetic and real-world benchmarks and empirically observe performances competitive with current state-of-the-art methods. Lastly, we present a unique application of the proposed models for causal inference directly from pairs of genome sequences belonging to the SARS-CoV-2 virus. Using a large number of sequences, we show that our models capture directed causal information exchange between sequence pairs, presenting novel opportunities for addressing key issues such as contact-tracing, motif discovery, evolution of virulence and pathogenicity in future applications.

READ FULL TEXT
research
04/12/2018

Causal Inference via Kernel Deviance Measures

Discovering the causal structure among a set of variables is a fundament...
research
03/21/2018

Causal Inference on Discrete Data via Estimating Distance Correlations

In this paper, we deal with the problem of inferring causal directions w...
research
02/22/2017

Causal Inference by Stochastic Complexity

The algorithmic Markov condition states that the most likely causal dire...
research
09/24/2021

Causal Analysis of Carnatic Music: A Preliminary Study

The musicological analysis of Carnatic music is challenging, owing to it...
research
10/31/2019

Causal Inference via Conditional Kolmogorov Complexity using MDL Binning

Recent developments have linked causal inference with Algorithmic Inform...
research
06/05/2020

Parallel ensemble methods for causal direction inference

Inferring the causal direction between two variables from their observat...

Please sign up or login with your details

Forgot password? Click here to reset