Global k-means++: an effective relaxation of the global k-means clustering algorithm

11/22/2022
by   Georgios Vardakas, et al.
0

The k-means algorithm is a very prevalent clustering method because of its simplicity, effectiveness, and speed, but its main disadvantage is its high sensitivity to the initial positions of the cluster centers. The global k-means is a deterministic algorithm proposed to tackle the random initialization problem of k-means but requires high computational cost. It partitions the data to K clusters by solving all k-means sub-problems incrementally for k=1,…, K. For each k cluster problem, the method executes the k-means algorithm N times, where N is the number of data points. In this paper, we propose the global k-means++ clustering algorithm, which is an effective way of acquiring quality clustering solutions akin to those of global k-means with a reduced computational load. This is achieved by exploiting the center section probability that is used in the effective k-means++ algorithm. The proposed method has been tested and compared in various well-known real and synthetic datasets yielding very satisfactory results in terms of clustering quality and execution speed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/15/2022

POCS-based Clustering Algorithm

A novel clustering technique based on the projection onto convex set (PO...
research
12/11/2013

Fast Approximate K-Means via Cluster Closures

K-means, a simple and effective clustering algorithm, is one of the most...
research
07/05/2019

Hybridized Threshold Clustering for Massive Data

As the size n of datasets become massive, many commonly-used clustering ...
research
10/09/2021

K-Splits: Improved K-Means Clustering Algorithm to Automatically Detect the Number of Clusters

This paper introduces k-splits, an improved hierarchical algorithm based...
research
10/31/2018

On the True Number of Clusters in a Dataset

One of the main challenges in cluster analysis is estimating the true nu...
research
06/13/2013

Non-parametric Power-law Data Clustering

It has always been a great challenge for clustering algorithms to automa...
research
02/27/2014

An Effective Evolutionary Clustering Algorithm: Hepatitis C Case Study

Clustering analysis plays an important role in scientific research and c...

Please sign up or login with your details

Forgot password? Click here to reset