Flexible Modeling of Hurdle Conway-Maxwell-Poisson Distributions with Application to Mining Injuries

by   Shuang Yin, et al.

While the hurdle Poisson regression is a popular class of models for count data with excessive zeros, the link function in the binary component may be unsuitable for highly imbalanced cases. Ordinary Poisson regression is unable to handle the presence of dispersion. In this paper, we introduce Conway-Maxwell-Poisson (CMP) distribution and integrate use of flexible skewed Weibull link functions as better alternative. We take a fully Bayesian approach to draw inference from the underlying models to better explain skewness and quantify dispersion, with Deviance Information Criteria (DIC) used for model selection. For empirical investigation, we analyze mining injury data for period 2013-2016 from the U.S. Mine Safety and Health Administration (MSHA). The risk factors describing proportions of employee hours spent in each type of mining work are compositional data; the probabilistic principal components analysis (PPCA) is deployed to deal with such covariates. The hurdle CMP regression is additionally adjusted for exposure, measured by the total employee working hours, to make inference on rate of mining injuries; we tested its competitiveness against other models. This can be used as predictive model in the mining workplace to identify features that increase the risk of injuries so that prevention can be implemented.


Skewed link regression models for imbalanced binary response with applications to life insurance

For a portfolio of life insurance policies observed for a stated period ...

Bayesian inference, model selection and likelihood estimation using fast rejection sampling: the Conway-Maxwell-Poisson distribution

Bayesian inference for models with intractable likelihood functions repr...

Bayesian Modeling of Nonlinear Poisson Regression with Artificial Neural Networks

Being in the era of big data, modeling and prediction of count data have...

Transition Models for Count Data: a Flexible Alternative to Fixed Distribution Models

A flexible semiparametric class of models is introduced that offers an a...

Bayesian CART models for insurance claims frequency

Accuracy and interpretability of a (non-life) insurance pricing model ar...

Infinitely imbalanced binomial regression and deformed exponential families

The logistic regression model is known to converge to a Poisson point pr...

Control Charts for Poisson Counts based on the Stein-Chen Identity

If monitoring Poisson count data for a possible mean shift (while the Po...

Please sign up or login with your details

Forgot password? Click here to reset