A LASSO-Penalized BIC for Mixture Model Selection

11/27/2012
by   Sakyajit Bhattacharya, et al.
0

The efficacy of family-based approaches to mixture model-based clustering and classification depends on the selection of parsimonious models. Current wisdom suggests the Bayesian information criterion (BIC) for mixture model selection. However, the BIC has well-known limitations, including a tendency to overestimate the number of components as well as a proclivity for, often drastically, underestimating the number of components in higher dimensions. While the former problem might be soluble through merging components, the latter is impossible to mitigate in clustering and classification applications. In this paper, a LASSO-penalized BIC (LPBIC) is introduced to overcome this problem. This approach is illustrated based on applications of extensions of mixtures of factor analyzers, where the LPBIC is used to select both the number of components and the number of latent factors. The LPBIC is shown to match or outperform the BIC in several situations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/23/2012

Mixture Model Averaging for Clustering

In mixture model-based clustering applications, it is common to fit seve...
research
05/11/2020

Infinite mixtures of multivariate normal-inverse Gaussian distributions for clustering of skewed data

Mixtures of multivariate normal inverse Gaussian (MNIG) distributions ca...
research
07/08/2013

Bridging Information Criteria and Parameter Shrinkage for Model Selection

Model selection based on classical information criteria, such as BIC, is...
research
12/24/2018

Model Selection for Mixture Models - Perspectives and Strategies

Determining the number G of components in a finite mixture distribution ...
research
11/24/2020

Persistent Mixture Model Networks for Few-Shot Image Classification

We introduce Persistent Mixture Model (PMM) networks for representation ...
research
11/15/2019

How bettering the best? Answers via blending models and cluster formulations in density-based clustering

With the recent growth in data availability and complexity, and the asso...
research
09/24/2017

On Fractionally-Supervised Classification: Weight Selection and Extension to the Multivariate t-Distribution

Recent work on fractionally-supervised classification (FSC), an approach...

Please sign up or login with your details

Forgot password? Click here to reset