Simple Sorting Criteria Help Find the Causal Order in Additive Noise Models

03/31/2023
by   Alexander G. Reisach, et al.
0

Additive Noise Models (ANM) encode a popular functional assumption that enables learning causal structure from observational data. Due to a lack of real-world data meeting the assumptions, synthetic ANM data are often used to evaluate causal discovery algorithms. Reisach et al. (2021) show that, for common simulation parameters, a variable ordering by increasing variance is closely aligned with a causal order and introduce var-sortability to quantify the alignment. Here, we show that not only variance, but also the fraction of a variable's variance explained by all others, as captured by the coefficient of determination R^2, tends to increase along the causal order. Simple baseline algorithms can use R^2-sortability to match the performance of established methods. Since R^2-sortability is invariant under data rescaling, these algorithms perform equally well on standardized or rescaled data, addressing a key limitation of algorithms exploiting var-sortability. We characterize and empirically assess R^2-sortability for different simulation parameters. We show that all simulation parameters can affect R^2-sortability and must be chosen deliberately to control the difficulty of the causal discovery task and the real-world plausibility of the simulated data. We provide an implementation of the sortability measures and sortability-based algorithms in our library CausalDisco (https://github.com/CausalDisco/CausalDisco).

READ FULL TEXT

page 12

page 13

page 19

page 23

page 24

page 25

page 26

research
02/26/2021

Beware of the Simulated DAG! Varsortability in Additive Noise Models

Additive noise models are a class of causal models in which each variabl...
research
04/05/2023

A step towards the applicability of algorithms based on invariant causal learning on observational data

Machine learning can benefit from causal discovery for interpretation an...
research
10/12/2022

Diffusion Models for Causal Discovery via Topological Ordering

Discovering causal relations from observational data becomes possible wi...
research
06/14/2022

Tearing Apart NOTEARS: Controlling the Graph Prediction via Variance Manipulation

Simulations are ubiquitous in machine learning. Especially in graph lear...
research
12/11/2014

Distinguishing cause from effect using observational data: methods and benchmarks

The discovery of causal relationships from purely observational data is ...
research
10/18/2018

An Upper Bound for Random Measurement Error in Causal Discovery

Causal discovery algorithms infer causal relations from data based on se...
research
03/06/2023

Boosting Differentiable Causal Discovery via Adaptive Sample Reweighting

Under stringent model type and variable distribution assumptions, differ...

Please sign up or login with your details

Forgot password? Click here to reset