A General Hybrid Clustering Technique

03/04/2015
by   Saeid Amiri, et al.
0

Here, we propose a clustering technique for general clustering problems including those that have non-convex clusters. For a given desired number of clusters K, we use three stages to find a clustering. The first stage uses a hybrid clustering technique to produce a series of clusterings of various sizes (randomly selected). They key steps are to find a K-means clustering using K_ℓ clusters where K_ℓ≫ K and then joins these small clusters by using single linkage clustering. The second stage stabilizes the result of stage one by reclustering via the `membership matrix' under Hamming distance to generate a dendrogram. The third stage is to cut the dendrogram to get K^* clusters where K^* ≥ K and then prune back to K to give a final clustering. A variant on our technique also gives a reasonable estimate for K_T, the true number of clusters. We provide a series of arguments to justify the steps in the stages of our methods and we provide numerous examples involving real and simulated data to compare our technique with other related techniques.

READ FULL TEXT
research
05/13/2015

Hybrid data clustering approach using K-Means and Flower Pollination Algorithm

Data clustering is a technique for clustering set of objects into known ...
research
12/06/2021

Piano Timbre Development Analysis using Machine Learning

A data set of recorded single played tones of a concert grand piano is i...
research
03/26/2020

A Two-Stage Reconstruction of Microstructures with Arbitrarily Shaped Inclusions

The main goal of our research is to develop an effective method with a w...
research
05/13/2022

DRBM-ClustNet: A Deep Restricted Boltzmann-Kohonen Architecture for Data Clustering

A Bayesian Deep Restricted Boltzmann-Kohonen architecture for data clust...
research
03/01/2017

Phylogenetic Tools in Astrophysics

Multivariate clustering in astrophysics is a recent development justifie...
research
08/01/2017

Deriving Verb Predicates By Clustering Verbs with Arguments

Hand-built verb clusters such as the widely used Levin classes (Levin, 1...
research
10/14/2020

Africa 3: A Continental Network Model to Enable the African Fourth Industrial Revolution

It is widely recognised that collaboration can help fast-track the devel...

Please sign up or login with your details

Forgot password? Click here to reset