Dimension reduction for model-based clustering

08/07/2015
by   Luca Scrucca, et al.
0

We introduce a dimension reduction method for visualizing the clustering structure obtained from a finite mixture of Gaussian densities. Information on the dimension reduction subspace is obtained from the variation on group means and, depending on the estimated mixture model, on the variation on group covariances. The proposed method aims at reducing the dimensionality by identifying a set of linear combinations, ordered by importance as quantified by the associated eigenvalues, of the original features which capture most of the cluster structure contained in the data. Observations may then be projected onto such a reduced subspace, thus providing summary plots which help to visualize the clustering structure. These plots can be particularly appealing in the case of high-dimensional data and noisy structure. The new constructed variables capture most of the clustering information available in the data, and they can be further reduced to improve clustering performance. We illustrate the approach on both simulated and real data sets.

READ FULL TEXT
research
08/28/2013

Clustering, Classification, Discriminant Analysis, and Dimension Reduction via Generalized Hyperbolic Mixtures

A method for dimension reduction with clustering, classification, or dis...
research
09/26/2019

Dynamic Partial Sufficient Dimension Reduction

Sufficient dimension reduction aims for reduction of dimensionality of a...
research
08/26/2015

Gaussian Mixture Models with Component Means Constrained in Pre-selected Subspaces

We investigate a Gaussian mixture model (GMM) with component means const...
research
09/15/2016

Recursive nearest agglomeration (ReNA): fast clustering for approximation of structured signals

-In this work, we revisit fast dimension reduction approaches, as with r...
research
01/12/2011

Simultaneous model-based clustering and visualization in the Fisher discriminative subspace

Clustering in high-dimensional spaces is nowadays a recurrent problem in...
research
08/20/2020

An Examination of Grouping and Spatial Organization Tasks for High-Dimensional Data Exploration

How do analysts think about grouping and spatial operations? This overar...
research
12/12/2022

Tandem clustering with invariant coordinate selection

For high-dimensional data or data with noise variables, tandem clusterin...

Please sign up or login with your details

Forgot password? Click here to reset