α-Approximation Density-based Clustering of Multi-valued Objects

08/09/2018
by   Zhilin Zhang, et al.
0

Multi-valued data are commonly found in many real applications. During the process of clustering multi-valued data, most existing methods use sampling or aggregation mechanisms that cannot reflect the real distribution of objects and their instances and thus fail to obtain high-quality clusters. In this paper, a concept of α-approximation distance is introduced to measure the connectivity between multi-valued objects by taking account of the distribution of the instances. An α-approximation density-based clustering algorithm (DBCMO) is proposed to efficiently cluster the multi-valued objects by using global and local R* tree structures. To speed up the algorithm, four pruning rules on the tree structures are implemented. Empirical studies on synthetic and real datasets demonstrate that DBCMO can efficiently and effectively discover the multi-valued object clusters. A comparison with two existing methods further shows that DBCMO can better handle a continuous decrease in the cluster density and detect clusters of varying density.

READ FULL TEXT
research
09/24/2020

Clustering Based on Graph of Density Topology

Data clustering with uneven distribution in high level noise is challeng...
research
10/02/2017

Clustering Stream Data by Exploring the Evolution of Density Mountain

Stream clustering is a fundamental problem in many streaming data analys...
research
04/20/2012

Automatic Sampling of Geographic objects

Today, one's disposes of large datasets composed of thousands of geograp...
research
10/05/2018

CDF Transform-Shift: An effective way to deal with inhomogeneous density datasets

Many distance-based algorithms exhibit bias towards dense clusters in in...
research
11/06/2019

HDBSCAN(): An Alternative Cluster Extraction Method for HDBSCAN

HDBSCAN is a density-based clustering algorithm that constructs a cluste...
research
10/11/2020

Local Connectivity in Centroid Clustering

Clustering is a fundamental task in unsupervised learning, one that targ...
research
04/29/2019

Clustering Optimization: Finding the Number and Centroids of Clusters by a Fourier-based Algorithm

We propose a Fourier-based approach for optimization of several clusteri...

Please sign up or login with your details

Forgot password? Click here to reset