Bayesian clustering of high-dimensional data via latent repulsive mixtures

03/04/2023
by   Lorenzo Ghilotti, et al.
0

Model-based clustering of moderate or large dimensional data is notoriously difficult. We propose a model for simultaneous dimensionality reduction and clustering by assuming a mixture model for a set of latent scores, which are then linked to the observations via a Gaussian latent factor model. This approach was recently investigated by Chandra et al. (2020). The authors use a factor-analytic representation and assume a mixture model for the latent factors. However, performance can deteriorate in the presence of model misspecification. Assuming a repulsive point process prior for the component-specific means of the mixture for the latent scores is shown to yield a more robust model that outperforms the standard mixture model for the latent factors in several simulated scenarios. To favor well-separated clusters of data, the repulsive point process must be anisotropic, and its density should be tractable for efficient posterior inference. We address these issues by proposing a general construction for anisotropic determinantal point processes.

READ FULL TEXT
research
07/21/2013

Mixtures of Common Skew-t Factor Analyzers

A mixture of common skew-t factor analyzers model is introduced for mode...
research
06/04/2020

Bayesian clustering of high-dimensional data

In many applications, it is of interest to cluster subjects based on ver...
research
11/17/2020

Defying the Circadian Rhythm: Clustering Participant Telemetry in the UK Biobank Data

The UK Biobank dataset follows over 500,000 volunteers and contains a di...
research
01/06/2021

Logistic Normal Multinomial Factor Analyzers for Clustering Microbiome Data

The human microbiome plays an important role in human health and disease...
research
07/04/2019

An enriched mixture model for functional clustering

There is an increasingly rich literature about Bayesian nonparametric mo...
research
11/26/2019

Robustly Clustering a Mixture of Gaussians

We give an efficient algorithm for robustly clustering of a mixture of a...
research
02/18/2019

Going deep in clustering high-dimensional data: deep mixtures of unigrams for uncovering topics in textual data

Mixtures of Unigrams (Nigam et al., 2000) are one of the simplest and mo...

Please sign up or login with your details

Forgot password? Click here to reset