On the Sample Complexity of Causal Discovery and the Value of Domain Expertise

02/05/2021
by   Samir Wadhwa, et al.
0

Causal discovery methods seek to identify causal relations between random variables from purely observational data, as opposed to actively collected experimental data where an experimenter intervenes on a subset of correlates. One of the seminal works in this area is the Inferred Causation algorithm, which guarantees successful causal discovery under the assumption of a conditional independence (CI) oracle: an oracle that can states whether two random variables are conditionally independent given another set of random variables. Practical implementations of this algorithm incorporate statistical tests for conditional independence, in place of a CI oracle. In this paper, we analyze the sample complexity of causal discovery algorithms without a CI oracle: given a certain level of confidence, how many data points are needed for a causal discovery algorithm to identify a causal structure? Furthermore, our methods allow us to quantify the value of domain expertise in terms of data samples. Finally, we demonstrate the accuracy of these sample rates with numerical examples, and quantify the benefits of sparsity priors and known causal directions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2020

A Bayesian Nonparametric Conditional Two-sample Test with an Application to Local Causal Discovery

The performance of constraint-based causal discovery algorithms is promi...
research
11/29/2022

Towards Dynamic Causal Discovery with Rare Events: A Nonparametric Conditional Independence Test

Causal phenomena associated with rare events occur across a wide range o...
research
03/03/2014

On the Intersection Property of Conditional Independence and its Application to Causal Discovery

This work investigates the intersection property of conditional independ...
research
07/05/2017

SADA: A General Framework to Support Robust Causation Discovery with Theoretical Guarantee

Causation discovery without manipulation is considered a crucial problem...
research
11/16/2022

Identifying the Causes of Pyrocumulonimbus (PyroCb)

A first causal discovery analysis from observational data of pyroCb (sto...
research
10/06/2020

Recovering Causal Structures from Low-Order Conditional Independencies

One of the common obstacles for learning causal models from data is that...
research
05/11/2023

Reinterpreting causal discovery as the task of predicting unobserved joint statistics

If X,Y,Z denote sets of random variables, two different data sources may...

Please sign up or login with your details

Forgot password? Click here to reset