A Geometric Approach to k-means

01/13/2022
by   Jiazhen Hong, et al.
0

k-means clustering is a fundamental problem in various disciplines. This problem is nonconvex, and standard algorithms are only guaranteed to find a local optimum. Leveraging the structure of local solutions characterized in [1], we propose a general algorithmic framework for escaping undesirable local solutions and recovering the global solution (or the ground truth). This framework consists of alternating between the following two steps iteratively: (i) detect mis-specified clusters in a local solution and (ii) improve the current local solution by non-local operations. We discuss implementation of these steps, and elucidate how the proposed framework unifies variants of k-means algorithm in literature from a geometric perspective. In addition, we introduce two natural extensions of the proposed framework, where the initial number of clusters is misspecified. We provide theoretical justification for our approach, which is corroborated with extensive experiments.

READ FULL TEXT

page 13

page 25

research
02/16/2020

Structures of Spurious Local Minima in k-means

k-means clustering is a fundamental problem in unsupervised learning. Th...
research
04/25/2018

HG-means: A scalable hybrid genetic algorithm for minimum sum-of-squares clustering

Minimum sum-of-squares clustering (MSSC) is a widely used clustering mod...
research
11/21/2016

Effective Deterministic Initialization for k-Means-Like Methods via Local Density Peaks Searching

The k-means clustering algorithm is popular but has the following main d...
research
12/15/2021

Sample-Efficient Sparse Phase Retrieval via Stochastic Alternating Minimization

In this work we propose a nonconvex two-stage stochastic alternating min...
research
05/05/2018

Cluster-based trajectory segmentation with local noise

We present a framework for the partitioning of a spatial trajectory in a...
research
10/12/2020

Exploiting Local Optimality in Metaheuristic Search

A variety of strategies have been proposed for overcoming local optimali...

Please sign up or login with your details

Forgot password? Click here to reset