Latent Network Estimation and Variable Selection for Compositional Data via Variational EM

10/25/2020
by   Nathan Osborne, et al.
11

Network estimation and variable selection have been extensively studied in the statistical literature, but only recently have those two challenges been addressed simultaneously. In this paper, we seek to develop a novel method to simultaneously estimate network interactions and associations to relevant covariates for count data, and specifically for compositional data, which have a fixed sum constraint. We use a hierarchical Bayesian model with latent layers and employ spike-and-slab priors for both edge and covariate selection. For posterior inference, we develop a variational inference scheme with an expectation maximization step, to enable efficient estimation. Through simulation studies, we demonstrate that the proposed model outperforms existing methods in its accuracy of network recovery. We show the practical utility of our model via an application to microbiome data. The human microbiome has been shown to contribute to many of the functions of the human body, and also to be linked with a number of diseases. In our application, we seek to better understand the interaction between microbes and relevant covariates, as well as the interaction of microbes with each other. We provide a Python implementation of our algorithm, called SINC (Simultaneous Inference for Networks and Covariates), available online.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/21/2022

A Variational Inference method for Bayesian variable selection

Variable selection is a classic problem in statistics. In this paper, we...
research
04/30/2020

A Bayesian model of microbiome data for simultaneous identification of covariate associations and prediction of phenotypic outcomes

One of the major research questions regarding human microbiome studies i...
research
02/23/2021

Identifying Gene-environment interactions with robust marginal Bayesian variable selection

In high-throughput genetics studies, an important aim is to identify gen...
research
09/05/2021

Statistical computation methods for microbiome compositional data network inference

Microbes can affect processes from food production to human health. Such...
research
07/11/2022

Sparse Dynamic Factor Models with Loading Selection by Variational Inference

In this paper we develop a novel approach for estimating large and spars...
research
09/17/2022

Bayesian Image-on-Scalar Regression with a Spatial Global-Local Spike-and-Slab Prior

In this article, we propose a novel spatial global-local spike-and-slab ...
research
06/16/2021

Semiparametric count data regression for self-reported mental health

"For how many days during the past 30 days was your mental health not go...

Please sign up or login with your details

Forgot password? Click here to reset