Node-Based Learning of Multiple Gaussian Graphical Models

by   Karthik Mohan, et al.

We consider the problem of estimating high-dimensional Gaussian graphical models corresponding to a single set of variables under several distinct conditions. This problem is motivated by the task of recovering transcriptional regulatory networks on the basis of gene expression data containing heterogeneous samples, such as different disease states, multiple species, or different developmental stages. We assume that most aspects of the conditional dependence networks are shared, but that there are some structured differences between them. Rather than assuming that similarities and differences between networks are driven by individual edges, we take a node-based approach, which in many cases provides a more intuitive interpretation of the network differences. We consider estimation under two distinct assumptions: (1) differences between the K networks are due to individual nodes that are perturbed across conditions, or (2) similarities among the K networks are due to the presence of common hub nodes that are shared across all K networks. Using a row-column overlap norm penalty function, we formulate two convex optimization problems that correspond to these two assumptions. We solve these problems using an alternating direction method of multipliers algorithm, and we derive a set of necessary and sufficient conditions that allows us to decompose the problem into independent subproblems so that our algorithm can be scaled to high-dimensional settings. Our proposal is illustrated on synthetic data, a webpage data set, and a brain cancer gene expression data set.


Learning Graphical Models With Hubs

We consider the problem of learning a high-dimensional graphical model i...

High-Dimensional Joint Estimation of Multiple Directed Gaussian Graphical Models

We consider the problem of jointly estimating multiple related directed ...

Learning Gene Regulatory Networks with High-Dimensional Heterogeneous Data

The Gaussian graphical model is a widely used tool for learning gene reg...

Learning Multiple Gene Regulatory Networks in Type 1 Diabetes through a Fast Bayesian Integrative Method

Accurate inference of Gene Regulatory Networks (GRNs) is pivotal to gain...

Tree-based Node Aggregation in Sparse Graphical Models

High-dimensional graphical models are often estimated using regularizati...

Searching for a source of difference in Gaussian graphical models

In this work, we look at a two-sample problem within the framework of Ga...

CoDiNA: an RPackage for Co-expression Differential Network Analysis in n Dimensions

Biological and Medical science is increasingly acknowledging the use of ...

Please sign up or login with your details

Forgot password? Click here to reset