Modeling Sparse Data Using MLE with Applications to Microbiome Data

12/27/2021
by   Hani Aldirawi, et al.
0

Modeling sparse data such as microbiome and transcriptomics (RNA-seq) data is very challenging due to the exceeded number of zeros and skewness of the distribution. Many probabilistic models have been used for modeling sparse data, including Poisson, negative binomial, zero-inflated Poisson, and zero-inflated negative binomial models. One way to identify the most appropriate probabilistic models for zero-inflated or hurdle models is based on the p-value of the Kolmogorov-Smirnov (KS) test. The main challenge for identifying the probabilistic model is that the model parameters are typically unknown in practice. This paper derives the maximum likelihood estimator (MLE) for a general class of zero-inflated and hurdle models. We also derive the corresponding Fisher information matrices for exploring the estimator's asymptotic properties. We include new probabilistic models such as zero-inflated beta binomial and zero-inflated beta negative binomial models. Our application to microbiome data shows that our new models are more appropriate for modeling microbiome data than commonly used models in the literature.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/12/2017

Zero-Modified Poisson-Lindley distribution with applications in zero-inflated and zero-deflated count data

The main object of this article is to present an extension of the zero-i...
research
05/03/2022

An R Package AZIAD for Analyzing Zero-Inflated and Zero-Altered Data

Sparse data with a large portion of zeros arise in many scientific disci...
research
11/03/2021

Inference of Microbial Interactions Using Copula Models with Mixture Margins

Quantification of microbial interactions from 16S rRNA and meta-genomic ...
research
05/09/2022

Asymptotic comparison of identifying constraints for Bradley-Terry models

The Bradley-Terry model is widely used for pairwise comparison data anal...
research
10/30/2019

Software defect prediction with zero-inflated Poisson models

In this work we apply several Poisson and zero-inflated models for softw...
research
03/30/2020

Exponential Dispersion Models for Overdispersed Zero-Inflated Count Data

We consider three new classes of exponential dispersion models of discre...
research
08/27/2022

Generally-Altered, -Inflated, -Truncated and -Deflated Regression, With Application to Heaped and Seeped Data

Models such as the zero-inflated and zero-altered Poisson and zero-trunc...

Please sign up or login with your details

Forgot password? Click here to reset