Hierarchical clustering by aggregating representatives in sub-minimum-spanning-trees

11/11/2021
by   Wen-Bo Xie, et al.
0

One of the main challenges for hierarchical clustering is how to appropriately identify the representative points in the lower level of the cluster tree, which are going to be utilized as the roots in the higher level of the cluster tree for further aggregation. However, conventional hierarchical clustering approaches have adopted some simple tricks to select the "representative" points which might not be as representative as enough. Thus, the constructed cluster tree is less attractive in terms of its poor robustness and weak reliability. Aiming at this issue, we propose a novel hierarchical clustering algorithm, in which, while building the clustering dendrogram, we can effectively detect the representative point based on scoring the reciprocal nearest data points in each sub-minimum-spanning-tree. Extensive experiments on UCI datasets show that the proposed algorithm is more accurate than other benchmarks. Meanwhile, under our analysis, the proposed algorithm has O(nlogn) time-complexity and O(logn) space-complexity, indicating that it has the scalability in handling massive data with less time and storage consumptions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/24/2014

An Effective Semi-supervised Divisive Clustering Algorithm

Nowadays, data are generated massively and rapidly from scientific field...
research
10/03/2018

Real-time Clustering Algorithm Based on Predefined Level-of-Similarity

This paper proposes a centroid-based clustering algorithm which is capab...
research
05/24/2023

Hierarchical clustering with dot products recovers hidden tree structure

In this paper we offer a new perspective on the well established agglome...
research
03/02/2022

A density peaks clustering algorithm with sparse search and K-d tree

Density peaks clustering has become a nova of clustering algorithm becau...
research
12/01/2022

Locally Adaptive Hierarchical Cluster Termination With Application To Individual Tree Delineation

A clustering termination procedure which is locally adaptive (with respe...
research
07/11/2014

Density Adaptive Parallel Clustering

In this paper we are going to introduce a new nearest neighbours based a...
research
07/09/2019

Hierarchical Clustering Supported by Reciprocal Nearest Neighbors

Clustering is a fundamental analysis tool aiming at classifying data poi...

Please sign up or login with your details

Forgot password? Click here to reset