An Improved Probability Propagation Algorithm for Density Peak Clustering Based on Natural Nearest Neighborhood

07/04/2022
by   Wendi Zuo, et al.
0

Clustering by fast search and find of density peaks (DPC) (Since, 2014) has been proven to be a promising clustering approach that efficiently discovers the centers of clusters by finding the density peaks. The accuracy of DPC depends on the cutoff distance (d_c), the cluster number (k) and the selection of the centers of clusters. Moreover, the final allocation strategy is sensitive and has poor fault tolerance. The shortcomings above make the algorithm sensitive to parameters and only applicable for some specific datasets. To overcome the limitations of DPC, this paper presents an improved probability propagation algorithm for density peak clustering based on the natural nearest neighborhood (DPC-PPNNN). By introducing the idea of natural nearest neighborhood and probability propagation, DPC-PPNNN realizes the nonparametric clustering process and makes the algorithm applicable for more complex datasets. In experiments on several datasets, DPC-PPNNN is shown to outperform DPC, K-means and DBSCAN.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/11/2020

Clustering of Big Data with Mixed Features

Clustering large, mixed data is a central problem in data mining. Many a...
research
04/10/2023

FINEX: A Fast Index for Exact Flexible Density-Based Clustering (Extended Version with Proofs)*

Density-based clustering aims to find groups of similar objects (i.e., c...
research
11/23/2019

A Domain Adaptive Density Clustering Algorithm for Data with Varying Density Distribution

As one type of efficient unsupervised learning methods, clustering algor...
research
07/25/2022

On Mitigating Hard Clusters for Face Clustering

Face clustering is a promising way to scale up face recognition systems ...
research
03/02/2022

A density peaks clustering algorithm with sparse search and K-d tree

Density peaks clustering has become a nova of clustering algorithm becau...
research
02/11/2019

A Distributed and Approximated Nearest Neighbors Algorithm for an Efficient Large Scale Mean Shift Clustering

In this paper we target the class of modal clustering methods where clus...
research
05/24/2023

Graphy Analysis Using a GPU-based Parallel Algorithm: Quantum Clustering

The article introduces a new method for applying Quantum Clustering to g...

Please sign up or login with your details

Forgot password? Click here to reset