A Review of Multivariate Distributions for Count Data Derived from the Poisson Distribution

08/31/2016
by   David I. Inouye, et al.
0

The Poisson distribution has been widely studied and used for modeling univariate count-valued data. Multivariate generalizations of the Poisson distribution that permit dependencies, however, have been far less popular. Yet, real-world high-dimensional count-valued data found in word counts, genomics, and crime statistics, for example, exhibit rich dependencies, and motivate the need for multivariate distributions that can appropriately model this data. We review multivariate distributions derived from the univariate Poisson, categorizing these models into three main classes: 1) where the marginal distributions are Poisson, 2) where the joint distribution is a mixture of independent multivariate Poisson distributions, and 3) where the node-conditional distributions are derived from the Poisson. We discuss the development of multiple instances of these classes and compare the models in terms of interpretability and theory. Then, we empirically compare multiple models from each class on three real-world datasets that have varying data characteristics from different domains, namely traffic accident data, biological next generation sequencing data, and text data. These empirical experiments develop intuition about the comparative advantages and disadvantages of each class of multivariate distribution that was derived from the Poisson. Finally, we suggest new research directions as explored in the subsequent discussion section.

READ FULL TEXT
research
04/15/2020

A parsimonious family of multivariate Poisson-lognormal distributions for clustering multivariate count data

Multivariate count data are commonly encountered through high-throughput...
research
03/11/2016

Square Root Graphical Models: Multivariate Generalizations of Univariate Exponential Families that Permit Positive Dependencies

We develop Square Root Graphical Models (SQR), a novel class of parametr...
research
01/03/2023

Multivariate Generalized Linear Mixed Models for Count Data

Univariate regression models have rich literature for counting data. How...
research
02/06/2018

Splitting models for multivariate count data

Considering discrete models, the univariate framework has been studied i...
research
01/10/2020

Review of Probability Distributions for Modeling Count Data

Count data take on non-negative integer values and are challenging to pr...
research
07/23/2018

Modeling event cascades using networks of additive count sequences

We propose a statistical model for networks of event count sequences bui...
research
05/30/2023

Control Charts for Poisson Counts based on the Stein-Chen Identity

If monitoring Poisson count data for a possible mean shift (while the Po...

Please sign up or login with your details

Forgot password? Click here to reset