An enhanced method of initial cluster center selection for K-means algorithm

10/18/2022
by   Zillur Rahman, et al.
0

Clustering is one of the widely used techniques to find out patterns from a dataset that can be applied in different applications or analyses. K-means, the most popular and simple clustering algorithm, might get trapped into local minima if not properly initialized and the initialization of this algorithm is done randomly. In this paper, we propose a novel approach to improve initial cluster selection for K-means algorithm. This algorithm is based on the fact that the initial centroids must be well separated from each other since the final clusters are separated groups in feature space. The Convex Hull algorithm facilitates the computing of the first two centroids and the remaining ones are selected according to the distance from previously selected centers. To ensure the selection of one center per cluster, we use the nearest neighbor technique. To check the robustness of our proposed algorithm, we consider several real-world datasets. We obtained only 7.33 Iris, Letter, and Ruspini data respectively which proves better performance than other existing systems. The results indicate that our proposed method outperforms the conventional K means approach by accelerating the computation when the number of clusters is greater than 2.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2018

Improvement of K Mean Clustering Algorithm Based on Density

The purpose of this paper is to improve the traditional K-means algorith...
research
02/16/2020

Structures of Spurious Local Minima in k-means

k-means clustering is a fundamental problem in unsupervised learning. Th...
research
03/15/2019

Tackling Initial Centroid of K-Means with Distance Part (DP-KMeans)

The initial centroid is a fairly challenging problem in the k-means meth...
research
01/09/2018

An efficient K -means clustering algorithm for massive data

The analysis of continously larger datasets is a task of major importanc...
research
11/21/2016

Effective Deterministic Initialization for k-Means-Like Methods via Local Density Peaks Searching

The k-means clustering algorithm is popular but has the following main d...
research
05/10/2013

Performance Enhancement of Distributed Quasi Steady-State Genetic Algorithm

This paper proposes a new scheme for performance enhancement of distribu...
research
05/10/2016

An efficient K-means algorithm for Massive Data

Due to the progressive growth of the amount of data available in a wide ...

Please sign up or login with your details

Forgot password? Click here to reset