Integrating overlapping datasets using bivariate causal discovery

by   Anish Dhir, et al.

Causal knowledge is vital for effective reasoning in science, as causal relations, unlike correlations, allow one to reason about the outcomes of interventions. Algorithms that can discover causal relations from observational data are based on the assumption that all variables have been jointly measured in a single dataset. In many cases this assumption fails. Previous approaches to overcoming this shortcoming devised algorithms that returned all joint causal structures consistent with the conditional independence information contained in each individual dataset. But, as conditional independence tests only determine causal structure up to Markov equivalence, the number of consistent joint structures returned by these approaches can be quite large. The last decade has seen the development of elegant algorithms for discovering causal relations beyond conditional independence, which can distinguish among Markov equivalent structures. In this work we adapt and extend these so-called bivariate causal discovery algorithms to the problem of learning consistent causal structures from multiple datasets with overlapping variables belonging to the same generating process, providing a sound and complete algorithm that outperforms previous approaches on synthetic and real data.



There are no comments yet.


page 1

page 2

page 3

page 4


Causal Generative Neural Networks

We introduce CGNN, a framework to learn functional causal models as gene...

Conditionally-additive-noise Models for Structure Learning

Constraint-based structure learning algorithms infer the causal structur...

Conditional Independences and Causal Relations implied by Sets of Equations

Real-world systems are often modelled by sets of equations with exogenou...

Causal Discovery from Changes

We propose a new method of discovering causal structures, based on the d...

Joint Causal Inference from Observational and Experimental Datasets

We introduce Joint Causal Inference (JCI), a powerful formulation of cau...

On the Sample Complexity of Causal Discovery and the Value of Domain Expertise

Causal discovery methods seek to identify causal relations between rando...

A Recursive Markov Blanket-Based Approach to Causal Structure Learning

One of the main approaches for causal structure learning is constraint-b...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.