# Variance partitioning in multilevel models for count data

A first step when fitting multilevel models to continuous responses is to explore the degree of clustering in the data. Researchers fit variance-component models and then report the proportion of variation in the response that is due to systematic differences between clusters or equally the response correlation between units within a cluster. These statistics are popularly referred to as variance partition coefficients (VPCs) and intraclass correlation coefficients (ICCs). When fitting multilevel models to categorical (binary, ordinal, or nominal) and count responses, these statistics prove more challenging to calculate. For categorical response models, researchers frequently appeal to their latent response formulations and report VPCs/ICCs in terms of latent continuous responses envisaged to underly the observed categorical responses. For standard count response models, however, there are no corresponding latent response formulations. More generally, there is a paucity of guidance on how to partition the variance. As a result, applied researchers are likely to avoid or inadequately report and discuss the substantive importance of clustering and cluster effects in their studies. A recent article drew attention to a little-known algebraic expression for the VPC/ICC for the special case of the two-level random-intercept Poisson model. In this article, we make a substantial new contribution. First, we derive VPC/ICC expressions for the more flexible negative binomial model that allows for overdispersion, a phenomenon which often occurs in practice with count data. Then we derive VPC/ICC expressions for three-level and random-coefficient extensions to these models. We illustrate all our work with an application to student absenteeism.

• 13 publications
• 1 publication
• 6 publications
• 1 publication
• 1 publication
07/12/2019

### Multilevel models for continuous outcomes

Multilevel models (mixed-effect models or hierarchical linear models) ar...
05/21/2022

### Multivariate generalized linear mixed models for underdispersed count data

Researchers are often interested in understanding the relationship betwe...
05/20/2020

### Dyadic Reciprocity as a Function of Covariates

Reciprocity in dyadic interactions is common and a topic of interest acr...
08/02/2019

### Generalised Joint Regression for Count Data with a Focus on Modelling Football Matches

We propose a versatile joint regression framework for count responses. T...
09/04/2017

### On synthetic data with predetermined subject partitioning and cluster profiling, and pre-specified categorical variable marginal dependence structure

A standard approach for assessing the performance of partition or mixtur...
07/06/2015

### A model of sensory neural responses in the presence of unknown modulatory inputs

Neural responses are highly variable, and some portion of this variabili...