Mixtures of multivariate generalized linear models with overlapping clusters

11/20/2019
by   Saverio Ranciati, et al.
0

With the advent of ubiquitous monitoring and measurement protocols, studies have started to focus more and more on complex, multivariate and heterogeneous datasets. In such studies, multivariate response variables are drawn from a heterogeneous population often in the presence of additional covariate information. In order to deal with this intrinsic heterogeneity, regression analyses have to be clustered for different groups of units. Up until now, mixture model approaches assigned units to distinct and non-overlapping groups. However, not rarely these units exhibit more complex organization and clustering. It is our aim to define a mixture of generalized linear models with overlapping clusters of units. This involves crucially an overlap function, that maps the coefficients of the parent clusters into the the coefficient of the multiple allocation units. We present a computationally efficient MCMC scheme that samples the posterior distribution of the parameters in the model. An example on a two-mode network study shows details of the implementation in the case of a multivariate probit regression setting. A simulation study shows the overall performance of the method, whereas an illustration of the voting behaviour on the US supreme court shows how the 9 justices split in two overlapping sets of justices.

READ FULL TEXT
research
04/23/2017

Sparse Latent Factor Models with Pure Variables for Overlapping Clustering

The problem of overlapping variable clustering, ubiquitous in data scien...
research
02/17/2018

Tests about R multivariate simple linear models

Hypothesis about the parallelism of the regression lines in R multivaria...
research
11/16/2021

Non-parametric Bayesian Vector Autoregression using Multi-subject Data

There has been a rich development of vector autoregressive (VAR) models ...
research
08/02/2022

Hypothesis tests for multiple responses regression models in R: The htmcglm Package

This article describes the R package htmcglm implemented for performing ...
research
05/04/2018

Mixture Envelope Model for Heterogeneous Genomics Data Analysis

Envelope model also known as multivariate regression model was proposed ...
research
03/23/2011

Clustered regression with unknown clusters

We consider a collection of prediction experiments, which are clustered ...

Please sign up or login with your details

Forgot password? Click here to reset