Posterior Distribution for the Number of Clusters in Dirichlet Process Mixture Models

05/23/2019
by   Chiao-Yu Yang, et al.
0

Dirichlet process mixture models (DPMM) play a central role in Bayesian nonparametrics, with applications throughout statistics and machine learning. DPMMs are generally used in clustering problems where the number of clusters is not known in advance, and the posterior distribution is treated as providing inference for this number. Recently, however, it has been shown that the DPMM is inconsistent in inferring the true number of components in certain cases. This is an asymptotic result, and it would be desirable to understand whether it holds with finite samples, and to more fully understand the full posterior. In this work, we provide a rigorous study for the posterior distribution of the number of clusters in DPMM under different prior distributions on the parameters and constraints on the distributions of the data. We provide novel lower bounds on the ratios of probabilities between s+1 clusters and s clusters when the prior distributions on parameters are chosen to be Gaussian or uniform distributions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2022

Bayesian mixture models (in)consistency for the number of clusters

Bayesian nonparametric mixture models are common for modeling complex da...
research
07/08/2018

BALSON: Bayesian Least Squares Optimization with Nonnegative L1-Norm Constraint

A Bayesian approach termed BAyesian Least Squares Optimization with Nonn...
research
07/19/2023

Entropy regularization in probabilistic clustering

Bayesian nonparametric mixture models are widely used to cluster observa...
research
03/30/2023

A review on Bayesian model-based clustering

Clustering is an important task in many areas of knowledge: medicine and...
research
09/23/2016

Fast Learning of Clusters and Topics via Sparse Posteriors

Mixture models and topic models generate each observation from a single ...
research
07/29/2022

Bayesian nonparametric mixture inconsistency for the number of components: How worried should we be in practice?

We consider the Bayesian mixture of finite mixtures (MFMs) and Dirichlet...
research
09/23/2019

On uniform continuity of posterior distributions

In the setting of dominated statistical models, we provide conditions yi...

Please sign up or login with your details

Forgot password? Click here to reset