An R Package AZIAD for Analyzing Zero-Inflated and Zero-Altered Data

05/03/2022
by   Niloufar Dousti Mousavi, et al.
0

Sparse data with a large portion of zeros arise in many scientific disciplines. Modeling sparse data is very challenging due to the skewness of the distribution. We adopt bootstrapped Monte Carlo method to estimate the p-value of the Kolmogorov-Smirnov test, as well as bootstrapped likelihood ratio tests for zero-inflated and zero-altered (or hurdle) model selection. Our new package AZIAD provides miscellaneous functions to simulate zero-inflated or zero-altered data and calculate maximum likelihood estimates of unknown parameters for a large class of discrete or continuous distributions. In addition, we calculate the Fisher information matrix and the confidence intervals of unknown parameters. Compared with other R packages available so far, our package covers many more types of zero-inflated and zero-altered distributions, provides more accurate estimates for unknown parameters, and achieves higher power for model selection. To facilitate the potential users, in this paper we provide theoretical justifications and detailed formulae for functions in AZIAD and illustrate the use of them with executable R code and real dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/27/2021

Modeling Sparse Data Using MLE with Applications to Microbiome Data

Modeling sparse data such as microbiome and transcriptomics (RNA-seq) da...
research
04/19/2023

Statistical inference for dependent competing risks data under adaptive Type-II progressive hybrid censoring

In this article, we consider statistical inference based on dependent co...
research
10/30/2019

Software defect prediction with zero-inflated Poisson models

In this work we apply several Poisson and zero-inflated models for softw...
research
05/09/2021

The zero-adjusted log-symmetric quantile regression model applied to extramarital affairs data

In this work, we propose a zero-adjusted log-symmetric quantile regressi...
research
12/21/2020

Uncertainty on the Reproduction Ratio in the SIR Model

The aim of this paper is to understand the extreme variability on the es...
research
01/02/2022

Bayesian Generalized Additive Model Selection Including a Fast Variational Option

We use Bayesian model selection paradigms, such as group least absolute ...
research
08/07/2018

Fisher information matrix for moving single molecules with stochastic trajectories

Tracking of objects in cellular environments has become a vital tool in ...

Please sign up or login with your details

Forgot password? Click here to reset