Warped Mixtures for Nonparametric Cluster Shapes

08/09/2014
by   Tomoharu Iwata, et al.
0

A mixture of Gaussians fit to a single curved or heavy-tailed cluster will report that the data contains many clusters. To produce more appropriate clusterings, we introduce a model which warps a latent mixture of Gaussians to produce nonparametric cluster shapes. The possibly low-dimensional latent mixture model allows us to summarize the properties of the high-dimensional clusters (or density manifolds) describing the data. The number of manifolds, as well as the shape and dimension of each manifold is automatically inferred. We derive a simple inference scheme for this model which analytically integrates out both the mixture parameters and the warping function. We show that our model is effective for density estimation, performs better than infinite Gaussian mixture models at recovering the true number of clusters, and produces interpretable summaries of high-dimensional datasets.

READ FULL TEXT

page 5

page 7

research
06/04/2020

Bayesian clustering of high-dimensional data

In many applications, it is of interest to cluster subjects based on ver...
research
06/29/2021

Estimating Gaussian mixtures using sparse polynomial moment systems

The method of moments is a statistical technique for density estimation ...
research
07/13/2023

Dynamic Mixture of Finite Mixtures of Factor Analysers with Automatic Inference on the Number of Clusters and Factors

Mixtures of factor analysers (MFA) models represent a popular tool for f...
research
11/17/2017

Principal Manifolds of Middles: A Framework and Estimation Procedure Using Mixture Densities

Principal manifolds are used to represent high-dimensional data in a low...
research
08/21/2020

Visual Analysis of Large Multivariate Scattered Data using Clustering and Probabilistic Summaries

Rapidly growing data sizes of scientific simulations pose significant ch...
research
02/22/2019

Model-based clustering in very high dimensions via adaptive projections

Mixture models are a standard approach to dealing with heterogeneous dat...
research
10/26/2010

A GMBCG Galaxy Cluster Catalog of 55,424 Rich Clusters from SDSS DR7

We present a large catalog of optically selected galaxy clusters from th...

Please sign up or login with your details

Forgot password? Click here to reset