Accelerating Spherical k-Means

07/08/2021
by   Erich Schubert, et al.
0

Spherical k-means is a widely used clustering algorithm for sparse and high-dimensional data such as document vectors. While several improvements and accelerations have been introduced for the original k-means algorithm, not all easily translate to the spherical variant: Many acceleration techniques, such as the algorithms of Elkan and Hamerly, rely on the triangle inequality of Euclidean distances. However, spherical k-means uses Cosine similarities instead of distances for computational efficiency. In this paper, we incorporate the Elkan and Hamerly accelerations to the spherical k-means algorithm working directly with the Cosines instead of Euclidean distances to obtain a substantial speedup and evaluate these spherical accelerations on real data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/30/2021

Efficient Sparse Spherical k-Means for Document Clustering

Spherical k-Means is frequently used to cluster document collections bec...
research
09/01/2022

Spherical Graph Drawing by Multi-dimensional Scaling

We describe an efficient and scalable spherical graph embedding method. ...
research
11/05/2019

Closing the Training/Inference Gap for Deep Attractor Networks

This paper improves the deep attractor network (DANet) approach by closi...
research
04/05/2019

k-means clustering of extremes

The k-means clustering algorithm and its variant, the spherical k-means ...
research
10/23/2020

Detection of groups of concomitant extremes using clustering

There is a growing empirical evidence that the spherical k-means cluster...
research
10/12/2018

Faster k-Medoids Clustering: Improving the PAM, CLARA, and CLARANS Algorithms

Clustering non-Euclidean data is difficult, and one of the most used alg...
research
01/16/2013

The Anchors Hierachy: Using the triangle inequality to survive high dimensional data

This paper is about metric data structures in high-dimensional or non-Eu...

Please sign up or login with your details

Forgot password? Click here to reset