On the Efficiency of K-Means Clustering: Evaluation, Optimization, and Algorithm Selection

10/13/2020
by   Sheng Wang, et al.
0

This paper presents a thorough evaluation of the existing methods that accelerate Lloyd's algorithm for fast k-means clustering. To do so, we analyze the pruning mechanisms of existing methods, and summarize their common pipeline into a unified evaluation framework UniK. UniK embraces a class of well-known methods and enables a fine-grained performance breakdown. Within UniK, we thoroughly evaluate the pros and cons of existing methods using multiple performance metrics on a number of datasets. Furthermore, we derive an optimized algorithm over UniK, which effectively hybridizes multiple existing methods for more aggressive pruning. To take this further, we investigate whether the most efficient method for a given clustering task can be automatically selected by machine learning, to benefit practitioners and researchers.

READ FULL TEXT

page 7

page 9

page 15

page 16

page 17

page 18

research
05/24/2023

Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis

Large Language Models (LLMs) have demonstrated great capabilities in sol...
research
10/13/2020

Coarse and fine-grained automatic cropping deep convolutional neural network

The existing convolutional neural network pruning algorithms can be divi...
research
04/14/2023

CiPR: An Efficient Framework with Cross-instance Positive Relations for Generalized Category Discovery

We tackle the issue of generalized category discovery (GCD). GCD conside...
research
10/04/2018

SNIP: Single-shot Network Pruning based on Connection Sensitivity

Pruning large neural networks while maintaining the performance is often...
research
07/08/2022

Evaluating Systemic Error Detection Methods using Synthetic Images

We introduce SpotCheck, a framework for generating synthetic datasets to...
research
08/16/2019

Parallel Computation of Alpha Complex for Biomolecules

Alpha complex, a subset of the Delaunay triangulation, has been extensiv...
research
09/12/2022

Open-Domain Dialog Evaluation using Follow-Ups Likelihood

Automatic evaluation of open-domain dialogs remains an unsolved problem....

Please sign up or login with your details

Forgot password? Click here to reset