VDPC: Variational Density Peak Clustering Algorithm

12/29/2021
by   Yizhang Wang, et al.
0

The widely applied density peak clustering (DPC) algorithm makes an intuitive cluster formation assumption that cluster centers are often surrounded by data points with lower local density and far away from other data points with higher local density. However, this assumption suffers from one limitation that it is often problematic when identifying clusters with lower density because they might be easily merged into other clusters with higher density. As a result, DPC may not be able to identify clusters with variational density. To address this issue, we propose a variational density peak clustering (VDPC) algorithm, which is designed to systematically and autonomously perform the clustering task on datasets with various types of density distributions. Specifically, we first propose a novel method to identify the representatives among all data points and construct initial clusters based on the identified representatives for further analysis of the clusters' property. Furthermore, we divide all data points into different levels according to their local density and propose a unified clustering framework by combining the advantages of both DPC and DBSCAN. Thus, all the identified initial clusters spreading across different density levels are systematically processed to form the final clusters. To evaluate the effectiveness of the proposed VDPC algorithm, we conduct extensive experiments using 20 datasets including eight synthetic, six real-world and six image datasets. The experimental results show that VDPC outperforms two classical algorithms (i.e., DPC and DBSCAN) and four state-of-the-art extended DPC algorithms.

READ FULL TEXT

page 2

page 11

page 18

research
11/23/2019

A Domain Adaptive Density Clustering Algorithm for Data with Varying Density Distribution

As one type of efficient unsupervised learning methods, clustering algor...
research
07/11/2022

Fast Density-Peaks Clustering: Multicore-based Parallelization Approach

Clustering multi-dimensional points is a fundamental task in many fields...
research
07/05/2014

Homophilic Clustering by Locally Asymmetric Geometry

Clustering is indispensable for data analysis in many scientific discipl...
research
03/14/2022

Geometric reconstructions of density based clusterings

DBSCAN* and HDBSCAN* are well established density based clustering algor...
research
07/20/2015

A Parameter-free Affinity Based Clustering

Several methods have been proposed to estimate the number of clusters in...
research
09/16/2020

Robust Unsupervised Mining of Dense Sub-Graphs at Multiple Resolutions

Whereas in traditional partitional clustering, each data point belongs t...

Please sign up or login with your details

Forgot password? Click here to reset