Mixture Model Averaging for Clustering

12/23/2012
by   Yuhong Wei, et al.
0

In mixture model-based clustering applications, it is common to fit several models from a family and report clustering results from only the `best' one. In such circumstances, selection of this best model is achieved using a model selection criterion, most often the Bayesian information criterion. Rather than throw away all but the best model, we average multiple models that are in some sense close to the best one, thereby producing a weighted average of clustering results. Two (weighted) averaging approaches are considered: averaging the component membership probabilities and averaging models. In both cases, Occam's window is used to determine closeness to the best model and weights are computed within a Bayesian model averaging paradigm. In some cases, we need to merge components before averaging; we introduce a method for merging mixture components based on the adjusted Rand index. The effectiveness of our model-based clustering averaging approaches is illustrated using a family of Gaussian mixture models on real and simulated data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/27/2017

Bayesian Model Averaging By Mixture Modeling

A new and numerically efficient method for Bayes factor computation and ...
research
11/27/2012

A LASSO-Penalized BIC for Mixture Model Selection

The efficacy of family-based approaches to mixture model-based clusterin...
research
09/09/2022

clusterBMA: Bayesian model averaging for clustering

Various methods have been developed to combine inference across multiple...
research
11/15/2019

How bettering the best? Answers via blending models and cluster formulations in density-based clustering

With the recent growth in data availability and complexity, and the asso...
research
08/03/2020

Bayesian model averaging for analysis of lattice field theory results

Statistical modeling is a key component in the extraction of physical re...
research
10/13/2020

Mixed data Deep Gaussian Mixture Model: A clustering model for mixed datasets

Clustering mixed data presents numerous challenges inherent to the very ...
research
12/24/2018

Model Selection for Mixture Models - Perspectives and Strategies

Determining the number G of components in a finite mixture distribution ...

Please sign up or login with your details

Forgot password? Click here to reset