Improving The Performance Of The K-means Algorithm

05/10/2020
by   Tien-Dung Nguyen, et al.
0

The Incremental K-means (IKM), an improved version of K-means (KM), was introduced to improve the clustering quality of KM significantly. However, the speed of IKM is slower than KM. My thesis proposes two algorithms to speed up IKM while remaining the quality of its clustering result approximately. The first algorithm, called Divisive K-means, improves the speed of IKM by speeding up its splitting process of clusters. Testing with UCI Machine Learning data sets, the new algorithm achieves the empirically global optimum as IKM and has lower complexity, O(k*log_2k*n), than IKM, O(k^2n). The second algorithm, called Parallel Two-Phase K-means (Par2PK-means), parallelizes IKM by employing the model of Two-Phase K-means. Testing with large data sets, this algorithm attains a good speedup ratio, closing to the linearly speed-up ratio.

READ FULL TEXT
research
07/09/2020

Modified Possibilistic Fuzzy C-Means Algorithm for Clustering Incomplete Data Sets

Possibilistic fuzzy c-means (PFCM) algorithm is a reliable algorithm has...
research
02/08/2012

Robust seed selection algorithm for k-means type algorithms

Selection of initial seeds greatly affects the quality of the clusters a...
research
03/30/2021

Structured Inverted-File k-Means Clustering for High-Dimensional Sparse Data

This paper presents an architecture-friendly k-means clustering algorith...
research
02/12/2022

Towards Continuous Consistency Axiom

Development of new algorithms in the area of machine learning, especiall...
research
03/22/2017

Clustering for Different Scales of Measurement - the Gap-Ratio Weighted K-means Algorithm

This paper describes a method for clustering data that are spread out ov...
research
12/22/2020

Fast and Accurate k-means++ via Rejection Sampling

k-means++ <cit.> is a widely used clustering algorithm that is easy to i...
research
10/15/2015

Sparsity-aware Possibilistic Clustering Algorithms

In this paper two novel possibilistic clustering algorithms are presente...

Please sign up or login with your details

Forgot password? Click here to reset