Mixed data Deep Gaussian Mixture Model: A clustering model for mixed datasets

10/13/2020
by   Robin Fuchs, et al.
0

Clustering mixed data presents numerous challenges inherent to the very heterogeneous nature of the variables. Two major difficulties lie in the initialisation of the algorithms and in making variables comparable between types. This work is concerned with these two problems. We introduce a two-heads architecture model-based clustering method called Mixed data Deep Gaussian Mixture Model (MDGMM) that can be viewed as an automatic way to merge the clusterings performed separately on continuous and non continuous data. We also design a new initialisation strategy and a data driven method that selects "on the fly" the best specification of the model and the optimal number of clusters for a given dataset. Besides, our model provides continuous low-dimensional representations of the data which can be a useful tool to visualize mixed datasets. Finally, we validate the performance of our approach comparing its results with state-of-the-art mixed data clustering models over several commonly used datasets

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2018

A Latent Gaussian Mixture Model for Clustering Longitudinal Data

Finite mixture models have become a popular tool for clustering. Amongst...
research
10/03/2020

EGMM: an Evidential Version of the Gaussian Mixture Model for Clustering

The Gaussian mixture model (GMM) provides a convenient yet principled fr...
research
09/06/2019

Unsupervised Clustering of Quantitative Imaging Phenotypes using Autoencoder and Gaussian Mixture Model

Quantitative medical image computing (radiomics) has been widely applied...
research
12/07/2020

Joint Optimization of an Autoencoder for Clustering and Embedding

Incorporating k-means-like clustering techniques into (deep) autoencoder...
research
06/09/2023

An introduction and tutorial to model-based clustering in education via Gaussian mixture modelling

Heterogeneity has been a hot topic in recent educational literature. Sev...
research
12/23/2012

Mixture Model Averaging for Clustering

In mixture model-based clustering applications, it is common to fit seve...
research
05/09/2019

A Bayesian Finite Mixture Model with Variable Selection for Data with Mixed-type Variables

Finite mixture model is an important branch of clustering methods and ca...

Please sign up or login with your details

Forgot password? Click here to reset