Bayesian Learning of Random Graphs & Correlation Structure of Multivariate Data, with Distance between Graphs

10/31/2017
by   Kangrui Wang, et al.
0

We present a method for the simultaneous Bayesian learning of the correlation matrix and graphical model of a multivariate dataset, using Metropolis-within-Gibbs inference. Here, the data comprises measurement of a vector-valued observable, that we model using a high-dimensional Gaussian Process (GP), such that, likelihood of GP parameters given the data, is Matrix-Normal, defined by a mean matrix and between-rows and between-columns covariance matrices. We marginalise over the between-row matrices, to achieve a closed-form likelihood of the between-columns correlation matrix, given the data. This correlation matrix is updated in the first block of an iteration, given the data, and the (generalised Binomial) graph is updated in the second block, at the partial correlation matrix that is computed given the updated correlation. We also learn the 95% Highest Probability Density credible regions of the correlation matrix as well as the graphical model of the data. The difference in the acknowledgement of measurement errors in learning the graphical model, is demonstrated on a small simulated dataset, while the large human disease-symptom network--with > 8,000 nodes--is learnt using real data. Data on the vino-chemical attributes of Portugese red and white wine samples are employed to learn the correlation structure and graphical model of each dataset, to then compute the distance between the learnt graphical models.

READ FULL TEXT

page 13

page 28

page 35

research
02/03/2020

Soft Random Graphs in Probabilistic Metric Spaces Inter-graph Distance

We present a new method for learning Soft Random Geometric Graphs (SRGGs...
research
05/26/2019

Gaussian DAGs on network data

The traditional directed acyclic graph (DAG) model assumes data are gene...
research
11/22/2018

Learning in the Absence of Training Data -- a Galactic Application

There are multiple real-world problems in which training data is unavail...
research
08/11/2023

Characterizing Correlation Matrices that Admit a Clustered Factor Representation

The Clustered Factor (CF) model induces a block structure on the correla...
research
09/10/2020

Graphical Gaussian Process Models for Highly Multivariate Spatial Data

For multivariate spatial (Gaussian) process models, common cross-covaria...
research
08/30/2017

Graphical Lasso and Thresholding: Equivalence and Closed-form Solutions

Graphical Lasso (GL) is a popular method for learning the structure of a...

Please sign up or login with your details

Forgot password? Click here to reset