Zero-inflated Poisson Factor Model with Application to Microbiome Absolute Abundance Data

10/26/2019
by   Tianchen Xu, et al.
0

Dimension reduction of high-dimensional microbiome data facilitates subsequent analysis such as regression and clustering. Most existing reduction methods cannot fully accommodate the special features of the data such as count-valued and excessive zero reads. We propose a zero-inflated Poisson factor analysis (ZIPFA) model in this article. The model assumes that microbiome absolute abundance data follow zero-inflated Poisson distributions with library size as offset and Poisson rates negatively related to the inflated zero occurrences. The latent parameters of the model form a low-rank matrix consisting of interpretable loadings and low-dimensional scores which can be used for further analyses. We develop an efficient and robust expectation-maximization (EM) algorithm for parameter estimation. We demonstrate the efficacy of the proposed method using comprehensive simulation studies. The application to the Oral Infections, Glucose Intolerance and Insulin Resistance Study (ORIGINS) provides valuable insights into the relation between subgingival microbiome and periodontal disease.

READ FULL TEXT

page 21

page 22

page 35

page 36

research
07/20/2022

An Integer GARCH model for a Poisson process with time varying zero-inflation

A time-varying zero-inflated serially dependent Poisson process is propo...
research
01/29/2018

Reparametrization of COM-Poisson Regression Models with Applications in the Analysis of Experimental Data

In the analysis of count data often the equidispersion assumption is not...
research
08/26/2022

Zero-Inflated Poisson Cluster-Weighted Models: Properties and Applications

In this paper, I propose a new class of Zero-Inflated Poisson models int...
research
10/27/2021

Poisson PCA for matrix count data

We develop a dimension reduction framework for data consisting of matric...
research
07/16/2022

A Flexible Zero-Inflated Poisson-Gamma model with application to microbiome read counts

In microbiome studies, it is of interest to use a sample from a populati...
research
08/10/2013

High-Dimensional Regression with Gaussian Mixtures and Partially-Latent Response Variables

In this work we address the problem of approximating high-dimensional da...
research
07/22/2022

A Supervised Tensor Dimension Reduction-Based Prognostics Model for Applications with Incomplete Imaging Data

This paper proposes a supervised dimension reduction methodology for ten...

Please sign up or login with your details

Forgot password? Click here to reset