Unsupervised learning of regression mixture models with unknown number of components

09/24/2014
by   Faicel Chamroukhi, et al.
0

Regression mixture models are widely studied in statistics, machine learning and data analysis. Fitting regression mixtures is challenging and is usually performed by maximum likelihood by using the expectation-maximization (EM) algorithm. However, it is well-known that the initialization is crucial for EM. If the initialization is inappropriately performed, the EM algorithm may lead to unsatisfactory results. The EM algorithm also requires the number of clusters to be given a priori; the problem of selecting the number of mixture components requires using model selection criteria to choose one from a set of pre-estimated candidate models. We propose a new fully unsupervised algorithm to learn regression mixture models with unknown number of components. The developed unsupervised learning approach consists in a penalized maximum likelihood estimation carried out by a robust expectation-maximization (EM) algorithm for fitting polynomial, spline and B-spline regressions mixtures. The proposed learning approach is fully unsupervised: 1) it simultaneously infers the model parameters and the optimal number of the regression mixture components from the data as the learning proceeds, rather than in a two-fold scheme as in standard model-based clustering using afterward model selection criteria, and 2) it does not require accurate initialization unlike the standard EM for regression mixtures. The developed approach is applied to curve clustering problems. Numerical experiments on simulated data show that the proposed robust EM algorithm performs well and provides accurate results in terms of robustness with regard initialization and retrieving the optimal partition with the actual number of clusters. An application to real data in the framework of functional data clustering, confirms the benefit of the proposed approach for practical applications.

READ FULL TEXT

page 17

page 19

page 24

page 25

page 26

page 27

page 31

page 35

research
12/25/2013

Robust EM algorithm for model-based curve clustering

Model-based clustering approaches concern the paradigm of exploratory da...
research
10/10/2021

Fitting large mixture models using stochastic component selection

Traditional methods for unsupervised learning of finite mixture models r...
research
01/31/2022

Spectral image clustering on dual-energy CT scans using functional regression mixtures

Dual-energy computed tomography (DECT) is an advanced CT scanning techni...
research
07/28/2022

Model based clustering of multinomial count data

We consider the problem of inferring an unknown number of clusters in re...
research
09/26/2020

An Adaptive EM Accelerator for Unsupervised Learning of Gaussian Mixture Models

We propose an Anderson Acceleration (AA) scheme for the adaptive Expecta...
research
08/04/2015

Bayesian mixtures of spatial spline regressions

This work relates the framework of model-based clustering for spatial fu...
research
07/29/2015

Context-aware learning for finite mixture models

This work introduces algorithms able to exploit contextual information i...

Please sign up or login with your details

Forgot password? Click here to reset