Are Easy Data Easy (for K-Means)

08/02/2023
by   Mieczysław A. Kłopotek, et al.
0

This paper investigates the capability of correctly recovering well-separated clusters by various brands of the k-means algorithm. The concept of well-separatedness used here is derived directly from the common definition of clusters, which imposes an interplay between the requirements of within-cluster-homogenicity and between-clusters-diversity. Conditions are derived for a special case of well-separated clusters such that the global minimum of k-means cost function coincides with the well-separatedness. An experimental investigation is performed to find out whether or no various brands of k-means are actually capable of discovering well separated clusters. It turns out that they are not. A new algorithm is proposed that is a variation of k-means++ via repeated subsampling when choosing a seed. The new algorithm outperforms four other algorithms from k-means family on the task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/08/2012

Robust seed selection algorithm for k-means type algorithms

Selection of initial seeds greatly affects the quality of the clusters a...
research
11/12/2017

K-groups: A Generalization of K-means Clustering

We propose a new class of distribution-based clustering algorithms, call...
research
02/16/2020

Structures of Spurious Local Minima in k-means

k-means clustering is a fundamental problem in unsupervised learning. Th...
research
06/09/2021

On Clusters that are Separated but Large

Given a set P of n points in ^d, consider the problem of computing k sub...
research
05/18/2017

Discovering the Graph Structure in the Clustering Results

In a standard cluster analysis, such as k-means, in addition to clusters...
research
05/10/2013

Performance Enhancement of Distributed Quasi Steady-State Genetic Algorithm

This paper proposes a new scheme for performance enhancement of distribu...
research
06/28/2020

Breathing k-Means

We propose a new algorithm for the k-means problem which repeatedly incr...

Please sign up or login with your details

Forgot password? Click here to reset