Variational inference for sparse network reconstruction from count data

06/08/2018
by   Julien Chiquet, et al.
0

In multivariate statistics, the question of finding direct interactions can be formulated as a problem of network inference - or network reconstruction - for which the Gaussian graphical model (GGM) provides a canonical framework. Unfortunately, the Gaussian assumption does not apply to count data which are encountered in domains such as genomics, social sciences or ecology. To circumvent this limitation, state-of-the-art approaches use two-step strategies that first transform counts to pseudo Gaussian observations and then apply a (partial) correlation-based approach from the abundant literature of GGM inference. We adopt a different stance by relying on a latent model where we directly model counts by means of Poisson distributions that are conditional to latent (hidden) Gaussian correlated variables. In this multivariate Poisson lognormal-model, the dependency structure is completely captured by the latent layer. This parametric model enables to account for the effects of covariates on the counts. To perform network inference, we add a sparsity inducing constraint on the inverse covariance matrix of the latent Gaussian vector. Unlike the usual Gaussian setting, the penalized likelihood is generally not tractable, and we resort instead to a variational approach for approximate likelihood maximization. The corresponding optimization problem is solved by alternating a gradient ascent on the variational parameters and a graphical-Lasso step on the covariance matrix. We show that our approach is highly competitive with the existing methods on simulation inspired from microbiological data. We then illustrate on three various data sets how accounting for sampling efforts via offsets and integrating external covariates (which is mostly never done in the existing literature) drastically changes the topology of the inferred network.

READ FULL TEXT
research
08/21/2022

A Graphical Model for Fusing Diverse Microbiome Data

This paper develops a Bayesian graphical model for fusing disparate type...
research
04/15/2020

A parsimonious family of multivariate Poisson-lognormal distributions for clustering multivariate count data

Multivariate count data are commonly encountered through high-throughput...
research
05/07/2019

Tree-based Reconstruction of Ecological Network from Abundance Data

The behavior of ecological systems mainly relies on the interactions bet...
research
06/05/2020

Bayesian Sparse Covariance Structure Analysis for Correlated Count Data

In this paper, we propose a Bayesian Graphical LASSO for correlated coun...
research
11/07/2021

Gene regulatory network in single cells based on the Poisson log-normal model

Gene regulatory network inference is crucial for understanding the compl...
research
12/16/2013

Parametric Modelling of Multivariate Count Data Using Probabilistic Graphical Models

Multivariate count data are defined as the number of items of different ...
research
06/30/2020

A spatial Poisson hurdle model with application to wildfires

Modelling wildfire occurrences is important for disaster management incl...

Please sign up or login with your details

Forgot password? Click here to reset