Identification of Latent Variables From Graphical Model Residuals

01/07/2021
by   Boris Hayete, et al.
0

Graph-based causal discovery methods aim to capture conditional independencies consistent with the observed data and differentiate causal relationships from indirect or induced ones. Successful construction of graphical models of data depends on the assumption of causal sufficiency: that is, that all confounding variables are measured. When this assumption is not met, learned graphical structures may become arbitrarily incorrect and effects implied by such models may be wrongly attributed, carry the wrong magnitude, or mis-represent direction of correlation. Wide application of graphical models to increasingly less curated "big data" draws renewed attention to the unobserved confounder problem. We present a novel method that aims to control for the latent space when estimating a DAG by iteratively deriving proxies for the latent space from the residuals of the inferred model. Under mild assumptions, our method improves structural inference of Gaussian graphical models and enhances identifiability of the causal effect. In addition, when the model is being used to predict outcomes, it un-confounds the coefficients on the parents of the outcomes and leads to improved predictive performance when out-of-sample regime is very different from the training data. We show that any improvement of prediction of an outcome is intrinsically capped and cannot rise beyond a certain limit as compared to the confounded model. We extend our methodology beyond GGMs to ordinal variables and nonlinear cases. Our R package provides both PCA and autoencoder implementations of the methodology, suitable for GGMs with some guarantees and for better performance in general cases but without such guarantees.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/14/2021

Learning Gaussian Graphical Models with Latent Confounders

Gaussian Graphical models (GGM) are widely used to estimate the network ...
research
03/17/2012

Learning loopy graphical models with latent variables: Efficient methods and guarantees

The problem of structure estimation in graphical models with latent vari...
research
07/20/2018

Learning the effect of latent variables in Gaussian Graphical models with unobserved variables

The edge structure of the graph defining an undirected graphical model d...
research
08/15/2023

Nonlinearity, Feedback and Uniform Consistency in Causal Structural Learning

The goal of Causal Discovery is to find automated search methods for lea...
research
07/10/2019

Identifying mediating variables with graphical models: an application to the study of causal pathways in people living with HIV

We empirically demonstrate that graphical models can be a valuable tool ...
research
02/02/2021

Time Adaptive Gaussian Model

Multivariate time series analysis is becoming an integral part of data a...
research
07/20/2020

Moment-Matching Graph-Networks for Causal Inference

In this note we explore a fully unsupervised deep-learning framework for...

Please sign up or login with your details

Forgot password? Click here to reset