Parametric entropy based Cluster Centriod Initialization for k-means clustering of various Image datasets

08/15/2023
by   Faheem Hussayn, et al.
0

One of the most employed yet simple algorithm for cluster analysis is the k-means algorithm. k-means has successfully witnessed its use in artificial intelligence, market segmentation, fraud detection, data mining, psychology, etc., only to name a few. The k-means algorithm, however, does not always yield the best quality results. Its performance heavily depends upon the number of clusters supplied and the proper initialization of the cluster centroids or seeds. In this paper, we conduct an analysis of the performance of k-means on image data by employing parametric entropies in an entropy based centroid initialization method and propose the best fitting entropy measures for general image datasets. We use several entropies like Taneja entropy, Kapur entropy, Aczel Daroczy entropy, Sharma Mittal entropy. We observe that for different datasets, different entropies provide better results than the conventional methods. We have applied our proposed algorithm on these datasets: Satellite, Toys, Fruits, Cars, Brain MRI, Covid X-Ray.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/04/2021

Entropy K-Means Clustering With Feature Reduction Under Unknown Number of Clusters

The k-means algorithm with its extensions is the most used clustering me...
research
04/11/2010

SAR Image Segmentation using Vector Quantization Technique on Entropy Images

The development and application of various remote sensing platforms resu...
research
09/24/2009

Initialization Free Graph Based Clustering

This paper proposes an original approach to cluster multi-component data...
research
11/27/2019

Adaptive Initialization Method for K-means Algorithm

The K-means algorithm is a widely used clustering algorithm that offers ...
research
06/06/2020

An Efficient k-modes Algorithm for Clustering Categorical Datasets

Mining clusters from datasets is an important endeavor in many applicati...
research
09/10/2012

A Comparative Study of Efficient Initialization Methods for the K-Means Clustering Algorithm

K-means is undoubtedly the most widely used partitional clustering algor...

Please sign up or login with your details

Forgot password? Click here to reset