Common Failure Modes of Subcluster-based Sampling in Dirichlet Process Gaussian Mixture Models – and a Deep-learning Solution

03/25/2022
by   Vlad Winter, et al.
0

The Dirichlet Process Gaussian Mixture Model (DPGMM) is often used to cluster data when the number of clusters is unknown. One main DPGMM inference paradigm relies on sampling. Here we consider a known state-of-art sampler (proposed by Chang and Fisher III (2013) and improved by Dinari et al. (2019)), analyze its failure modes, and show how to improve it, often drastically. Concretely, in that sampler, whenever a new cluster is formed it is augmented with two subclusters whose labels are initialized at random. Upon their evolution, the subclusters serve to propose a split of the parent cluster. We show that the random initialization is often problematic and hurts the otherwise-effective sampler. Specifically, we demonstrate that this initialization tends to lead to poor split proposals and/or too many iterations before a desired split is accepted. This slows convergence and can damage the clustering. As a remedy, we propose two drop-in-replacement options for the subcluster-initialization subroutine. The first is an intuitive heuristic while the second is based on deep learning. We show that the proposed approach yields better splits, which in turn translate to substantial improvements in performance, results, and stability.

READ FULL TEXT

page 14

page 22

page 23

research
01/14/2015

Dirichlet Process Parsimonious Mixtures for clustering

The parsimonious Gaussian mixture models, which exploit an eigenvalue de...
research
06/16/2020

On the Variational Posterior of Dirichlet Process Deep Latent Gaussian Mixture Models

Thanks to the reparameterization trick, deep latent Gaussian models have...
research
03/15/2012

Dirichlet Process Mixtures of Generalized Mallows Models

We present a Dirichlet process mixture model over discrete incomplete ra...
research
10/07/2019

Gaussian Mixture Clustering Using Relative Tests of Fit

We consider clustering based on significance tests for Gaussian Mixture ...
research
08/21/2017

Neural Block Sampling

Efficient Monte Carlo inference often requires manual construction of mo...
research
02/23/2017

Analyzing Learned Convnet Features with Dirichlet Process Gaussian Mixture Models

Convolutional Neural Networks (Convnets) have achieved good results in a...
research
08/22/2023

A Study of Particle Motion in the Presence of Clusters

The motivation for this study came from the task of analysing the kineti...

Please sign up or login with your details

Forgot password? Click here to reset