Modal clustering on PPGMMGA projection subspace

06/22/2021
by   Luca Scrucca, et al.
0

PPGMMGA is a Projection Pursuit (PP) algorithm aimed at detecting and visualizing clustering structures in multivariate data. The algorithm uses the negentropy as PP index obtained by fitting Gaussian Mixture Models (GMMs) for density estimation, and then optimized using Genetic Algorithms (GAs). Since the PPGMMGA algorithm is a dimension reduction technique specifically introduced for visualization purposes, cluster memberships are not explicitly provided. In this paper a modal clustering approach is proposed for estimating clusters of projected data points. In particular, a modal EM algorithm is employed to estimate the modes corresponding to the local maxima in the projection subspace of the underlying density estimated using parsimonious GMMs. Data points are then clustered according to the domain of attraction of the identified modes. Simulated and real data are discussed to illustrate the proposed method and evaluate the clustering performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/27/2019

Projection pursuit based on Gaussian mixtures and evolutionary algorithms

We propose a projection pursuit (PP) algorithm based on Gaussian mixture...
research
02/10/2020

A fast and efficient Modal EM algorithm for Gaussian mixtures

In the modal approach to clustering, clusters are defined as the local m...
research
07/15/2015

Unsupervised Decision Forest for Data Clustering and Density Estimation

An algorithm to improve performance parameter for unsupervised decision ...
research
01/23/2020

Expected Information Maximization: Using the I-Projection for Mixture Density Estimation

Modelling highly multi-modal data is a challenging problem in machine le...
research
01/12/2011

Simultaneous model-based clustering and visualization in the Fisher discriminative subspace

Clustering in high-dimensional spaces is nowadays a recurrent problem in...
research
08/26/2015

Gaussian Mixture Models with Component Means Constrained in Pre-selected Subspaces

We investigate a Gaussian mixture model (GMM) with component means const...
research
06/13/2016

Modal-set estimation with an application to clustering

We present a first procedure that can estimate -- with statistical consi...

Please sign up or login with your details

Forgot password? Click here to reset