Causal Discovery in Heterogeneous Environments Under the Sparse Mechanism Shift Hypothesis

06/04/2022
by   Ronan Perry, et al.
11

Machine learning approaches commonly rely on the assumption of independent and identically distributed (i.i.d.) data. In reality, however, this assumption is almost always violated due to distribution shifts between environments. Although valuable learning signals can be provided by heterogeneous data from changing distributions, it is also known that learning under arbitrary (adversarial) changes is impossible. Causality provides a useful framework for modeling distribution shifts, since causal models encode both observational and interventional distributions. In this work, we explore the sparse mechanism shift hypothesis, which posits that distribution shifts occur due to a small number of changing causal conditionals. Motivated by this idea, we apply it to learning causal structure from heterogeneous environments, where i.i.d. data only allows for learning an equivalence class of graphs without restrictive assumptions. We propose the Mechanism Shift Score (MSS), a score-based approach amenable to various empirical estimators, which provably identifies the entire causal structure with high probability if the sparse mechanism shift hypothesis holds. Empirically, we verify behavior predicted by the theory and compare multiple estimators and score functions to identify the best approaches in practice. Compared to other methods, we show how MSS bridges a gap by both being nonparametric as well as explicitly leveraging sparse changes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/05/2019

Causal Discovery and Hidden Driving Force Estimation from Nonstationary/Heterogeneous Data

It is commonplace to encounter nonstationary or heterogeneous data. Such...
research
11/05/2022

Adversarial Causal Augmentation for Graph Covariate Shift

Out-of-distribution (OOD) generalization on graphs is drawing widespread...
research
02/10/2022

Learning Latent Causal Dynamics

One critical challenge of time-series modeling is how to learn and quick...
research
01/13/2020

Stake Shift in Major Cryptocurrencies: An Empirical Study

In the proof-of-stake (PoS) paradigm for maintaining decentralized, perm...
research
06/30/2023

iSCAN: Identifying Causal Mechanism Shifts among Nonlinear Additive Noise Models

Structural causal models (SCMs) are widely used in various disciplines t...
research
06/13/2022

Invariant Structure Learning for Better Generalization and Causal Explainability

Learning the causal structure behind data is invaluable for improving ge...
research
07/28/2023

Optimal multi-environment causal regularization

In this manuscript we derive the optimal out-of-sample causal predictor ...

Please sign up or login with your details

Forgot password? Click here to reset