Model Selection for Mixture Models - Perspectives and Strategies

12/24/2018
by   Gilles Celeux, et al.
0

Determining the number G of components in a finite mixture distribution is an important and difficult inference issue. This is a most important question, because statistical inference about the resulting model is highly sensitive to the value of G. Selecting an erroneous value of G may produce a poor density estimate. This is also a most difficult question from a theoretical perspective as it relates to unidentifiability issues of the mixture model. This is further a most relevant question from a practical viewpoint since the meaning of the number of components G is strongly related to the modelling purpose of a mixture distribution. We distinguish in this chapter between selecting G as a density estimation problem in Section 2 and selecting G in a model-based clustering framework in Section 3. Both sections discuss frequentist as well as Bayesian approaches. We present here some of the Bayesian solutions to the different interpretations of picking the "right" number of components in a mixture, before concluding on the ill-posed nature of the question.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2022

Bayesian Repulsive Mixture Modeling with Matérn Point Processes

Mixture models are a standard tool in statistical analysis, widely used ...
research
01/16/2013

Model Selection for Gaussian Mixture Models

This paper is concerned with an important issue in finite mixture modell...
research
11/27/2012

A LASSO-Penalized BIC for Mixture Model Selection

The efficacy of family-based approaches to mixture model-based clusterin...
research
07/20/2018

On a Loss-based prior for the number of components in mixture models

We propose a prior distribution for the number of components of a finite...
research
11/15/2019

How bettering the best? Answers via blending models and cluster formulations in density-based clustering

With the recent growth in data availability and complexity, and the asso...
research
12/23/2012

Mixture Model Averaging for Clustering

In mixture model-based clustering applications, it is common to fit seve...
research
05/10/2019

Statistical inference with anchored Bayesian mixture of regressions models: A case study analysis of allometric data

We present a case study in which we use a mixture of regressions model t...

Please sign up or login with your details

Forgot password? Click here to reset