Grouped Heterogeneous Mixture Modeling for Clustered Data

04/03/2018
by   Shonosuke Sugasawa, et al.
0

Clustered data which has a grouping structure (e.g. postal area, school, individual, species) appears in a variety of scientific fields. The goal of statistical analysis of clustered data is modeling the response as a function of covariates while accounting for heterogeneity among clusters. For this purpose, we consider estimating cluster-wise conditional distributions by mixtures of latent conditional distributions common to all the clusters with cluster-wise different mixing proportions. For modeling the mixing proportions, we propose a structure that clusters are divided into finite number of groups and mixing proportions are assumed to be the same within the same group. The proposed model is interpretable and the maximum likelihood estimator is easy to compute via the generalized EM algorithm. In the setting where the cluster sizes grows with, but much more slowly than, the number of clusters, some asymptotic properties of the maximum likelihood estimator are presented. Furthermore, we propose an information criterion for selecting two tuning parameters, number of groups and latent conditional distributions. Numerical studies demonstrate that the proposed model outperforms some other existing methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/22/2021

Increasing Cluster Size Asymptotics for Nested Error Regression Models

This paper establishes asymptotic results for the maximum likelihood and...
research
08/22/2023

EM for Mixture of Linear Regression with Clustered Data

Modern data-driven and distributed learning frameworks deal with diverse...
research
06/07/2021

Estimating the size of a closed population by modeling latent and observed heterogeneity

The paper describe a new class of capture-recapture models for closed po...
research
06/22/2018

Grouped Mixture of Regressions

Finite Mixture of Regressions (FMR) models are among the most widely use...
research
06/30/2020

Real Elliptically Skewed Distributions and Their Application to Robust Cluster Analysis

This article proposes a new class of Real Elliptically Skewed (RESK) dis...
research
08/10/2020

Exact log-likelihood for clustering parameterised models and normally distributed data

Taking a model with equal means in each cluster, the log-likelihood for ...
research
11/05/2019

A Conway-Maxwell-Multinomial Distribution for Flexible Modeling of Clustered Categorical Data

Categorical data are often observed as counts resulting from a fixed num...

Please sign up or login with your details

Forgot password? Click here to reset