An efficient density-based clustering algorithm using reverse nearest neighbour

11/19/2018
by   Stiphen Chowdhury, et al.
0

Density-based clustering is the task of discovering high-density regions of entities (clusters) that are separated from each other by contiguous regions of low-density. DBSCAN is, arguably, the most popular density-based clustering algorithm. However, its cluster recovery capabilities depend on the combination of the two parameters. In this paper we present a new density-based clustering algorithm which uses reverse nearest neighbour (RNN) and has a single parameter. We also show that it is possible to estimate a good value for this parameter using a clustering validity index. The RNN queries enable our algorithm to estimate densities taking more than a single entity into account, and to recover clusters that are not well-separated or have different densities. Our experiments on synthetic and real-world data sets show our proposed algorithm outperforms DBSCAN and its recent variant ISDBSCAN.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/02/2023

SDC-HSDD-NDSA: Structure Detecting Cluster by Hierarchical Secondary Directed Differential with Normalized Density and Self-Adaption

Density-based clustering could be the most popular clustering algorithm ...
research
11/21/2016

Effective Deterministic Initialization for k-Means-Like Methods via Local Density Peaks Searching

The k-means clustering algorithm is popular but has the following main d...
research
11/06/2019

A Hybrid Approach To Hierarchical Density-based Cluster Selection

HDBSCAN is a density-based clustering algorithm that constructs a cluste...
research
02/16/2013

Clustering validity based on the most similarity

One basic requirement of many studies is the necessity of classifying da...
research
11/06/2019

HDBSCAN(): An Alternative Cluster Extraction Method for HDBSCAN

HDBSCAN is a density-based clustering algorithm that constructs a cluste...
research
05/25/2018

COREclust: a new package for a robust and scalable analysis of complex data

In this paper, we present a new R package COREclust dedicated to the det...

Please sign up or login with your details

Forgot password? Click here to reset