HCA-DBSCAN: HyperCube Accelerated Density Based Spatial Clustering for Applications with Noise

12/01/2019
by   Vinayak Mathur, et al.
0

Density-based clustering has found numerous applications across various domains. The Density-Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm is capable of finding clusters of varied shapes that are not linearly separable, at the same time it is not sensitive to outliers in the data. Combined with the fact that the number of clusters in the data are not required apriori makes DBSCAN really powerfully. Slower performance (O(n2)) limits its applications. In this work, we present a new clustering algorithm, the HyperCube Accelerated DBSCAN(HCA-DBSCAN) which uses a combination of distance-based aggregation by overlaying the data with customized grids. We use representative points to reduce the number of comparisons that need to be computed. Experimental results show that the proposed algorithm achieves a significant run time speedup of up to 58.27 improvements that try to reduce the time complexity of theDBSCAN algorithm

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset