Review of Probability Distributions for Modeling Count Data

01/10/2020
by   F. William Townes, et al.
0

Count data take on non-negative integer values and are challenging to properly analyze using standard linear-Gaussian methods such as linear regression and principal components analysis. Generalized linear models enable direct modeling of counts in a regression context using distributions such as the Poisson and negative binomial. When counts contain only relative information, multinomial or Dirichlet-multinomial models can be more appropriate. We review some of the fundamental connections between multinomial and count models from probability theory, providing detailed proofs. These relationships are useful for methods development in applications such as topic modeling of text data and genomics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/14/2019

Bayesian generalized linear model for over and under dispersed counts

Bayesian models that can handle both over and under dispersed counts are...
research
07/23/2018

Modeling event cascades using networks of additive count sequences

We propose a statistical model for networks of event count sequences bui...
research
05/28/2023

Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling

Learning to denoise has emerged as a prominent paradigm to design state-...
research
05/20/2015

Variable subset selection via GA and information complexity in mixtures of Poisson and negative binomial regression models

Count data, for example the number of observed cases of a disease in a c...
research
08/03/2023

Telematics Combined Actuarial Neural Networks for Cross-Sectional and Longitudinal Claim Count Data

We present novel cross-sectional and longitudinal claim count models for...
research
08/31/2016

A Review of Multivariate Distributions for Count Data Derived from the Poisson Distribution

The Poisson distribution has been widely studied and used for modeling u...
research
12/22/2017

Modeling Spatial Overdispersion with the Generalized Waring Process

Modeling spatial overdispersion requires point processes models with fin...

Please sign up or login with your details

Forgot password? Click here to reset