Causal Order Identification to Address Confounding: Binary Variables

08/10/2021
by   Joe Suzuki, et al.
0

This paper considers an extension of the linear non-Gaussian acyclic model (LiNGAM) that determines the causal order among variables from a dataset when the variables are expressed by a set of linear equations, including noise. In particular, we assume that the variables are binary. The existing LiNGAM assumes that no confounding is present, which is restrictive in practice. Based on the concept of independent component analysis (ICA), this paper proposes an extended framework in which the mutual information among the noises is minimized. Another significant contribution is to reduce the realization of the shortest path problem. The distance between each pair of nodes expresses an associated mutual information value, and the path with the minimum sum (KL divergence) is sought. Although p! mutual information values should be compared, this paper dramatically reduces the computation when no confounding is present. The proposed algorithm finds the globally optimal solution, while the existing locally greedily seek the order based on hypothesis testing. We use the best estimator in the sense of Bayes/MDL that correctly detects independence for mutual information estimation. Experiments using artificial and actual data show that the proposed version of LiNGAM achieves significantly better performance, particularly when confounding is present.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/06/2020

Confounding Ghost Channels and Causality: A New Approach to Causal Information Flows

Information theory provides a fundamental framework for the quantificati...
research
01/28/2018

Probability Mass Exclusions and the Directed Components of Pointwise Mutual Information

The pointwise mutual information quantifies the mutual information betwe...
research
11/14/2022

The Best Path Algorithm automatic variables selection via High Dimensional Graphical Models

This paper proposes a new algorithm for an automatic variable selection ...
research
05/21/2018

Multiple Causal Inference with Latent Confounding

Causal inference from observational data requires assumptions. These ass...
research
01/30/2017

Interaction Information for Causal Inference: The Case of Directed Triangle

Interaction information is one of the multivariate generalizations of mu...
research
12/07/2018

Information-Distilling Quantizers

Let X and Y be dependent random variables. This paper considers the prob...
research
09/10/2019

Adversarial Orthogonal Regression: Two non-Linear Regressions for Causal Inference

We propose two nonlinear regression methods, named Adversarial Orthogona...

Please sign up or login with your details

Forgot password? Click here to reset