Bayesian Nonparametric Mixed Effects Models in Microbiome Data Analysis

11/03/2017
by   Boyu Ren, et al.
0

Detecting associations between microbial composition and sample characteristics is one of the most important tasks in microbiome studies. Most of the existing methods apply univariate models to single microbial species separately, with adjustments for multiple hypothesis testing. We propose a Bayesian nonparametric analysis for a generalized mixed effects linear model tailored to this application. The marginal prior on each microbial composition is a Dirichlet Processes, and dependence across compositions is induced through a linear combination of individual covariates, such as disease biomarkers or the subject's age, and latent factors. The latent factors capture residual variability and their dimensionality is learned from the data in a fully Bayesian procedure. We propose an efficient algorithm to sample from the posterior and visualizations of model parameters which reveal associations between covariates and microbial composition. The proposed model is validated in simulation studies and then applied to analyze a microbiome dataset for infants with Type I diabetes.

READ FULL TEXT

page 9

page 19

page 21

page 22

page 24

page 26

page 28

research
12/19/2017

High dimensional Single Index Bayesian Modeling of the Brain Atrophy over time

We study the effects of gender, APOE genes, age, genetic variation and A...
research
10/26/2017

Bayesian Nonparametric Models for Biomedical Data Analysis

In this dissertation, we develop nonparametric Bayesian models for biome...
research
08/13/2018

A Nonparametric Bayesian Method for Clustering of High-Dimensional Mixed Dataset

Motivation: Advances in next-generation sequencing (NGS) methods have en...
research
04/30/2020

A Bayesian model of microbiome data for simultaneous identification of covariate associations and prediction of phenotypic outcomes

One of the major research questions regarding human microbiome studies i...
research
05/05/2021

A Bayesian latent allocation model for clustering compositional data with application to the Great Barrier Reef

Relative abundance is a common metric to estimate the composition of spe...
research
04/01/2022

A Class of Semiparametric Models with Homogeneous Structure for Panel Data Analysis

Stimulated by the analysis of a dataset from China about Covid-19, we pr...

Please sign up or login with your details

Forgot password? Click here to reset