A Fast Approximation Scheme for Low-Dimensional k-Means

08/24/2017
by   Vincent Cohen-Addad, et al.
0

We consider the popular k-means problem in d-dimensional Euclidean space. Recently Friggstad, Rezapour, Salavatipour [FOCS'16] and Cohen-Addad, Klein, Mathieu [FOCS'16] showed that the standard local search algorithm yields a (1+ϵ)-approximation in time (n · k)^1/ϵ^O(d), giving the first polynomial-time approximation scheme for the problem in low-dimensional Euclidean space. While local search achieves optimal approximation guarantees, it is not competitive with the state-of-the-art heuristics such as the famous k-means++ and D^2-sampling algorithms. In this paper, we aim at bridging the gap between theory and practice by giving a (1+ϵ)-approximation algorithm for low-dimensional k-means running in time n · k · ( n)^(dϵ^-1)^O(d), and so matching the running time of the k-means++ and D^2-sampling heuristics up to polylogarithmic factors. We speed-up the local search approach by making a non-standard use of randomized dissections that allows to find the best local move efficiently using a quite simple dynamic program. We hope that our techniques could help design better local search heuristics for geometric problems. We note that the doubly exponential dependency on d is necessary as k-means is APX-hard in dimension d = ω( n).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/29/2016

Local Search Yields a PTAS for k-Means in Doubling Metrics

The most well known and ubiquitous clustering problem encountered in nea...
research
03/12/2009

Adaptive Mesh Approach for Predicting Algorithm Behavior with Application to Visibility Culling in Computer Graphics

We propose a concise approximate description, and a method for efficient...
research
04/06/2022

Efficient Bayesian Network Structure Learning via Parameterized Local Search on Topological Orderings

In Bayesian Network Structure Learning (BNSL), one is given a variable s...
research
02/18/2020

k-means++: few more steps yield constant approximation

The k-means++ algorithm of Arthur and Vassilvitskii (SODA 2007) is a sta...
research
03/21/2011

Informed Heuristics for Guiding Stem-and-Cycle Ejection Chains

The state of the art in local search for the Traveling Salesman Problem ...
research
04/09/2021

Stagnation Detection in Highly Multimodal Fitness Landscapes

Stagnation detection has been proposed as a mechanism for randomized sea...
research
02/25/2020

The Power of Recourse: Better Algorithms for Facility Location in Online and Dynamic Models

In this paper we study the facility location problem in the online with ...

Please sign up or login with your details

Forgot password? Click here to reset