Exponential Dispersion Models for Overdispersed Zero-Inflated Count Data

03/30/2020
by   Shaul K. Bar-Lev, et al.
0

We consider three new classes of exponential dispersion models of discrete probability distributions which are defined by specifying their variance functions in their mean value parameterization. In a previous paper (Bar-Lev and Ridder, 2020a), we have developed the framework of these classes and proved that they have some desirable properties. Each of these classes was shown to be overdispersed and zero inflated in ascending order, making them as competitive statistical models for those in use in statistical modeling. In this paper we elaborate on the computational aspects of their probability mass functions. Furthermore, we apply these classes for fitting real data sets having overdispersed and zero-inflated statistics. Classic models based on Poisson or negative binomial distributions show poor fits, and therefore many alternatives have already proposed in recent years. We execute an extensive comparison with these other proposals, from which we may conclude that our framework is a flexible tool that gives excellent results in all cases. Moreover, in most cases our model gives the best fit.

READ FULL TEXT

page 12

page 13

research
03/30/2020

New exponential dispersion models for count data – properties and applications

In their fundamental paper on cubic variance functions (VFs), Letac and ...
research
11/09/2020

Characterizations of non-normalized discrete probability distributions and their application in statistics

From the distributional characterizations that lie at the heart of Stein...
research
10/30/2019

Software defect prediction with zero-inflated Poisson models

In this work we apply several Poisson and zero-inflated models for softw...
research
12/27/2021

Modeling Sparse Data Using MLE with Applications to Microbiome Data

Modeling sparse data such as microbiome and transcriptomics (RNA-seq) da...
research
11/26/2018

Tweedie Gradient Boosting for Extremely Unbalanced Zero-inflated Data

Tweedie's compound Poisson model is a popular method to model insurance ...
research
12/06/2017

Fitting a Hurdle Generalized Lambda Distribution to healthcare expenses

In order to fit a model to healthcare expenses data, it is necessary to ...
research
06/01/2016

Generalizing and Hybridizing Count-based and Neural Language Models

Language models (LMs) are statistical models that calculate probabilitie...

Please sign up or login with your details

Forgot password? Click here to reset