Learning Gene Regulatory Networks with High-Dimensional Heterogeneous Data

05/07/2018
by   Bochao Jia, et al.
0

The Gaussian graphical model is a widely used tool for learning gene regulatory networks with high-dimensional gene expression data. Most existing methods for Gaussian graphical models assume that the data are homogeneous, i.e., all samples are drawn from a single Gaussian distribution. However, for many real problems, the data are heterogeneous, which may contain some subgroups or come from different resources. This paper proposes to model the heterogeneous data using a mixture Gaussian graphical model, and apply the imputation-consistency algorithm, combining with the ψ-learning algorithm, to estimate the parameters of the mixture model and cluster the samples to different subgroups. An integrated Gaussian graphical network is learned across the subgroups along with the iterations of the imputation-consistency algorithm. The proposed method is compared with an existing method for learning mixture Gaussian graphical models as well as a few other methods developed for homogeneous data, such as graphical Lasso, nodewise regression and ψ-learning. The numerical results indicate superiority of the proposed method in all aspects of parameter estimation, cluster identification and network construction. The numerical results also indicate generality of the proposed method: it can be applied to homogeneous data without significant harms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/31/2015

Nonparametric mixture of Gaussian graphical models

Graphical model has been widely used to investigate the complex dependen...
research
12/08/2022

A Double Regression Method for Graphical Modeling of High-dimensional Nonlinear and Non-Gaussian Data

Graphical models have long been studied in statistics as a tool for infe...
research
10/28/2015

Robust Gaussian Graphical Modeling with the Trimmed Graphical Lasso

Gaussian Graphical Models (GGMs) are popular tools for studying network ...
research
03/21/2013

Node-Based Learning of Multiple Gaussian Graphical Models

We consider the problem of estimating high-dimensional Gaussian graphica...
research
06/19/2020

Mixture of Conditional Gaussian Graphical Models for unlabelled heterogeneous populations in the presence of co-factors

Conditional correlation networks, within Gaussian Graphical Models (GGM)...
research
04/29/2020

Autoregressive Identification of Kronecker Graphical Models

We address the problem to estimate a Kronecker graphical model correspon...
research
12/07/2022

Network Analysis of Count Data from Mixed Populations

In applications such as gene regulatory network analysis based on single...

Please sign up or login with your details

Forgot password? Click here to reset