Quantifying Distances Between Clusters with Elliptical or Non-Elliptical Shapes

06/23/2022
by   Meredith L. Wallace, et al.
0

Finite mixture models that allow for a broad range of potentially non-elliptical cluster distributions is an emerging methodological field. Such methods allow for the shape of the clusters to match the natural heterogeneity of the data, rather than forcing a series of elliptical clusters. These methods are highly relevant for clustering continuous non-normal data - a common occurrence with objective data that are now routinely captured in health research. However, interpreting and comparing such models - especially with regards to whether they produce meaningful clusters that are reasonably well separated - is non-trivial. We summarize several measures that can succinctly quantify the multivariate distance between two clusters, regardless of the cluster distribution, and suggest practical computational tools. Through a simulation study, we evaluate these measures across three scenarios that allow for clusters to differ in mean, scale, and rotation. We then demonstrate our approaches using physiological responses to emotional imagery captured as part of the Transdiagnostic Anxiety Study, a large-scale study of anxiety disorder spectrum patients and control participants. Finally, we synthesize findings to provide guidance on how to use distance measures in clustering applications.

READ FULL TEXT

page 7

page 9

page 10

page 12

page 15

page 16

page 17

research
09/24/2017

Interdependence of clusters measures and distance distribution in compact metric spaces

A compact metric space (X, ρ) is given. Let μ be a Borel measure on X. B...
research
08/22/2021

The Exploitation of Distance Distributions for Clustering

Although distance measures are used in many machine learning algorithms,...
research
05/25/2023

Metrics for quantifying isotropy in high dimensional unsupervised clustering tasks in a materials context

Clustering is a common task in machine learning, but clusters of unlabel...
research
02/04/2019

A note on the geometry of the MAP partition in some Normal Bayesian Mixture Models

We investigate the geometry of the maximal a posteriori (MAP) partition ...
research
08/25/2021

Clustering acoustic emission data streams with sequentially appearing clusters using mixture models

The interpretation of unlabeled acoustic emission (AE) data classically ...
research
09/23/2016

Fast Learning of Clusters and Topics via Sparse Posteriors

Mixture models and topic models generate each observation from a single ...
research
10/01/2018

Accelerated Training of Large-Scale Gaussian Mixtures by a Merger of Sublinear Approaches

We combine two recent lines of research on sublinear clustering to signi...

Please sign up or login with your details

Forgot password? Click here to reset