A review on Bayesian model-based clustering

03/30/2023
by   Clara Grazian, et al.
0

Clustering is an important task in many areas of knowledge: medicine and epidemiology, genomics, environmental science, economics, visual sciences, among others. Methodologies to perform inference on the number of clusters have often been proved to be inconsistent, and introducing a dependence structure among the clusters implies additional difficulties in the estimation process. In a Bayesian setting, clustering is performed by considering the unknown partition as a random object and define a prior distribution on it. This prior distribution may be induced by models on the observations, or directly defined for the partition. Several recent results, however, have shown the difficulties in consistently estimating the number of clusters, and, therefore, the partition. The problem itself of summarising the posterior distribution on the partition remains open, given the large dimension of the partition space. This work aims at reviewing the Bayesian approaches available in the literature to perform clustering, presenting advantages and disadvantages of each of them in order to suggest future lines of research.

READ FULL TEXT
research
07/19/2023

Entropy regularization in probabilistic clustering

Bayesian nonparametric mixture models are widely used to cluster observa...
research
05/23/2019

Posterior Distribution for the Number of Clusters in Dirichlet Process Mixture Models

Dirichlet process mixture models (DPMM) play a central role in Bayesian ...
research
01/30/2022

Why the Rich Get Richer? On the Balancedness of Random Partition Models

Random partition models are widely used in Bayesian methods for various ...
research
07/19/2022

Clustering constrained on linear networks

An unsupervised classification method for point events occurring on a ne...
research
06/14/2023

Graph-Aligned Random Partition Model (GARP)

Bayesian nonparametric mixtures and random partition models are powerful...
research
01/29/2019

Centered Partition Process: Informative Priors for Clustering

There is a very rich literature proposing Bayesian approaches for cluste...
research
08/03/2023

Similarity-based Random Partition Distribution for Clustering Functional Data

Random partitioned distribution is a powerful tool for model-based clust...

Please sign up or login with your details

Forgot password? Click here to reset