Dealing with overdispersion in multivariate count data

07/01/2021
by   Noemi Corsini, et al.
0

The problem of overdispersion in multivariate count data is a challenging issue. Nowadays, it covers a central role mainly due to the relevance of modern technologies data, such as Next Generation Sequencing and textual data from the web or digital collections. This work presents a comprehensive analysis of the likelihood-based models for extra-variation data proposed in the scientific literature. Particular attention will be paid to the models feasible for high-dimensional data. A new approach together with its parametric-estimation procedure is proposed. It is a deeper version of the Dirichlet-Multinomial distribution and it leads to important results allowing to get a better approximation of the observed variability. A significative comparison of these models is made through two different simulation studies that both confirm that the new model considered in this work allows to achieve the best results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/30/2023

A multivariate heavy-tailed integer-valued GARCH process with EM algorithm-based inference

A new multivariate integer-valued Generalized AutoRegressive Conditional...
research
02/23/2023

A Bayesian Zero-Inflated Dirichlet-Multinomial Regression Model for Multivariate Compositional Count Data

The Dirichlet-multinomial (DM) distribution plays a fundamental role in ...
research
04/15/2020

A parsimonious family of multivariate Poisson-lognormal distributions for clustering multivariate count data

Multivariate count data are commonly encountered through high-throughput...
research
12/23/2020

Score matching for compositional distributions

Compositional data and multivariate count data with known totals are cha...
research
02/18/2019

Going deep in clustering high-dimensional data: deep mixtures of unigrams for uncovering topics in textual data

Mixtures of Unigrams (Nigam et al., 2000) are one of the simplest and mo...
research
07/02/2020

High-dimensional MANOVA via Bootstrapping and its Application to Functional and Sparse Count Data

We propose a new approach to the problem of high-dimensional multivariat...
research
05/29/2020

Multiresolution Decomposition of Areal Count Data

Multiresolution decomposition is commonly understood as a procedure to c...

Please sign up or login with your details

Forgot password? Click here to reset