Flexible Modeling of Hurdle Conway-Maxwell-Poisson Distributions with Application to Mining Injuries

by   Shuang Yin, et al.

While the hurdle Poisson regression is a popular class of models for count data with excessive zeros, the link function in the binary component may be unsuitable for highly imbalanced cases. Ordinary Poisson regression is unable to handle the presence of dispersion. In this paper, we introduce Conway-Maxwell-Poisson (CMP) distribution and integrate use of flexible skewed Weibull link functions as better alternative. We take a fully Bayesian approach to draw inference from the underlying models to better explain skewness and quantify dispersion, with Deviance Information Criteria (DIC) used for model selection. For empirical investigation, we analyze mining injury data for period 2013-2016 from the U.S. Mine Safety and Health Administration (MSHA). The risk factors describing proportions of employee hours spent in each type of mining work are compositional data; the probabilistic principal components analysis (PPCA) is deployed to deal with such covariates. The hurdle CMP regression is additionally adjusted for exposure, measured by the total employee working hours, to make inference on rate of mining injuries; we tested its competitiveness against other models. This can be used as predictive model in the mining workplace to identify features that increase the risk of injuries so that prevention can be implemented.



There are no comments yet.


page 18


Skewed link regression models for imbalanced binary response with applications to life insurance

For a portfolio of life insurance policies observed for a stated period ...

Bayesian inference, model selection and likelihood estimation using fast rejection sampling: the Conway-Maxwell-Poisson distribution

Bayesian inference for models with intractable likelihood functions repr...

Bayesian Modeling of Nonlinear Poisson Regression with Artificial Neural Networks

Being in the era of big data, modeling and prediction of count data have...

Infinitely imbalanced binomial regression and deformed exponential families

The logistic regression model is known to converge to a Poisson point pr...

Variable subset selection via GA and information complexity in mixtures of Poisson and negative binomial regression models

Count data, for example the number of observed cases of a disease in a c...

A hierarchical model of non-homogeneous Poisson processes for Twitter retweets

We present a hierarchical model of non-homogeneous Poisson processes (NH...

Selection of link function in binary regression: A case-study with world happiness report on immigration

Selection of appropriate link function for binary regression remains an ...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.