Causal Clustering for 1-Factor Measurement Models on Data with Various Types

by   Shuyan Wang, et al.

The tetrad constraint is a condition of which the satisfaction signals a rank reduction of a covariance submatrix and is used to design causal discovery algorithms that detects the existence of latent (unmeasured) variables, such as FOFC. Initially such algorithms only work for cases where the measured and latent variables are all Gaussian and have linear relations (Gaussian-Gaussian Case). It has been shown that a unidimentional latent variable model implies tetrad constraints when the measured and latent variables are all binary (Binary-Binary case). This paper proves that the tetrad constraint can also be entailed when the measured variables are of mixed data types and when the measured variables are discrete and the latent common causes are continuous, which implies that any clustering algorithm relying on this constraint can work on those cases. Each case is shown with an example and a proof. The performance of FOFC on mixed data is shown by simulation studies and is compared with some algorithms with similar functions.


Detecting Causal Relations in the Presence of Unmeasured Variables

The presence of latent variables can greatly complicate inferences about...

High Dimensional Semiparametric Latent Graphical Model for Mixed Data

Graphical models are commonly used tools for modeling multivariate rando...

Invariant Gaussian Process Latent Variable Models and Application in Causal Discovery

In nonlinear latent variable models or dynamic models, if we consider th...

Measurement Dependence Inducing Latent Causal Models

We consider the task of causal structure learning over measurement depen...

ASP-based Discovery of Semi-Markovian Causal Models under Weaker Assumptions

In recent years the possibility of relaxing the so-called Faithfulness a...

Factor copula models for mixed data

We develop factor copula models for analysing the dependence among mixed...

A Graphical Model for Fusing Diverse Microbiome Data

This paper develops a Bayesian graphical model for fusing disparate type...