Hierarchical Clustering Supported by Reciprocal Nearest Neighbors

07/09/2019
by   Wen-Bo Xie, et al.
0

Clustering is a fundamental analysis tool aiming at classifying data points into groups based on their similarity or distance. It has found successful applications in all natural and social sciences, including biology, physics, economics, chemistry, astronomy, psychology, and so on. Among numerous existent algorithms, hierarchical clustering algorithms are of a particular advantage as they can provide results under different resolutions without any predetermined number of clusters and unfold the organization of resulted clusters. At the same time, they suffer a variety of drawbacks and thus are either time-consuming or inaccurate. We propose a novel hierarchical clustering approach on the basis of a simple hypothesis that two reciprocal nearest data points should be grouped in one cluster. Extensive tests on data sets across multiple domains show that our method is much faster and more accurate than the state-of-the-art benchmarks. We further extend our method to deal with the community detection problem in real networks, achieving remarkably better results in comparison with the well-known Girvan-Newman algorithm.

READ FULL TEXT
research
03/15/2022

Natural Hierarchical Cluster Analysis by Nearest Neighbors with Near-Linear Time Complexity

We propose a nearest neighbor based clustering algorithm that results in...
research
09/16/2020

Robust Unsupervised Mining of Dense Sub-Graphs at Multiple Resolutions

Whereas in traditional partitional clustering, each data point belongs t...
research
06/19/2017

On comparing clusterings: an element-centric framework unifies overlaps and hierarchy

Clustering is one of the most universal approaches for understanding com...
research
05/25/2021

Scaling Hierarchical Agglomerative Clustering to Billion-sized Datasets

Hierarchical Agglomerative Clustering (HAC) is one of the oldest but sti...
research
11/11/2021

Hierarchical clustering by aggregating representatives in sub-minimum-spanning-trees

One of the main challenges for hierarchical clustering is how to appropr...
research
02/14/2012

Hierarchical Affinity Propagation

Affinity propagation is an exemplar-based clustering algorithm that find...
research
12/10/2021

Interpretable Clustering via Multi-Polytope Machines

Clustering is a popular unsupervised learning tool often used to discove...

Please sign up or login with your details

Forgot password? Click here to reset