Clustering consistency with Dirichlet process mixtures

05/25/2022
by   Filippo Ascolani, et al.
0

Dirichlet process mixtures are flexible non-parametric models, particularly suited to density estimation and probabilistic clustering. In this work we study the posterior distribution induced by Dirichlet process mixtures as the sample size increases, and more specifically focus on consistency for the unknown number of clusters when the observed data are generated from a finite mixture. Crucially, we consider the situation where a prior is placed on the concentration parameter of the underlying Dirichlet process. Previous findings in the literature suggest that Dirichlet process mixtures are typically not consistent for the number of clusters if the concentration parameter is held fixed and data come from a finite mixture. Here we show that consistency for the number of clusters can be achieved if the concentration parameter is adapted in a fully Bayesian way, as commonly done in practice. Our results are derived for data coming from a class of finite mixtures, with mild assumptions on the prior for the concentration parameter and for a variety of choices of likelihood kernels for the mixture.

READ FULL TEXT
research
08/30/2013

Inconsistency of Pitman-Yor process mixtures for the number of components

In many applications, a finite mixture is a natural model, but it can be...
research
09/29/2014

Adaptive Low-Complexity Sequential Inference for Dirichlet Process Mixture Models

We develop a sequential low-complexity inference procedure for Dirichlet...
research
08/02/2020

Dirichlet-tree multinomial mixtures for clustering microbiome compositions

A common routine in microbiome research is to identify reproducible patt...
research
09/07/2018

Dirichlet process mixtures under affine transformations of the data

Location-scale Dirichlet process mixtures of Gaussians (DPM-G) have prov...
research
07/29/2022

Bayesian nonparametric mixture inconsistency for the number of components: How worried should we be in practice?

We consider the Bayesian mixture of finite mixtures (MFMs) and Dirichlet...
research
05/20/2020

Dynamic mixtures of finite mixtures and telescoping sampling

Within a Bayesian framework, a comprehensive investigation of the model ...
research
06/04/2023

Bayesian nonparametric modeling of latent partitions via Stirling-gamma priors

Dirichlet process mixtures are particularly sensitive to the value of th...

Please sign up or login with your details

Forgot password? Click here to reset