Estimating the Number of Clusters via Normalized Cluster Instability

08/26/2016
by   Jonas M. B. Haslbeck, et al.
0

We improve existing instability-based methods for the selection of the number of clusters k in cluster analysis by normalizing instability. In contrast to existing instability methods which only perform well for bounded sequences of small k, our method performs well across the whole sequence of possible k. In addition, we compare for the first time model-based and model-free variants of k selection via cluster instability and find that their performance is similar. We make our method available in the R-package +cstab+.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2023

Flexible Variable Selection for Clustering and Classification

The importance of variable selection for clustering has been recognized ...
research
07/24/2023

Finite Size Effects in Addition and Chipping Processes

We investigate analytically and numerically a system of clusters evolvin...
research
05/05/2020

Cluster-based dual evolution for multivariate systems

This paper proposes a cluster-based method to analyse multivariate syste...
research
03/14/2023

Optimal Study Designs for Cluster Randomised Trials: An Overview of Methods and Results

There are multiple cluster randomised trial designs that vary in when th...
research
07/27/2020

Modeling the Influence of Visual Density on Cluster Perception in Scatterplots Using Topology

Scatterplots are used for a variety of visual analytics tasks, including...
research
12/07/2020

Cluster analysis of presolar silicon carbide grains: evaluation of their classification and astrophysical implications

Cluster analysis of presolar silicon carbide grains based on literature ...
research
07/26/2022

An Effective Method for Identifying Clusters of Robot Strengths

In the analysis of qualification data from the FIRST Robotics Competitio...

Please sign up or login with your details

Forgot password? Click here to reset