Coresets for k-Means and k-Median Clustering and their Applications

10/30/2018
by   Sariel Har-Peled, et al.
0

In this paper, we show the existence of small coresets for the problems of computing k-median and k-means clustering for points in low dimension. In other words, we show that given a point set P in ^d, one can compute a weighted set ⊆ P, of size O(k ^-dn), such that one can compute the k-median/means clustering on instead of on P, and get an (1+)-approximation. As a result, we improve the fastest known algorithms for (1+)-approximate k-means and k-median clustering. Our algorithms have linear running time for a fixed k and . In addition, we can maintain the (1+)-approximate k-median or k-means clustering of a stream when points are being only inserted, using polylogarithmic space and update time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/28/2019

Tight FPT Approximations for k-Median and k-Means

We investigate the fine-grained complexity of approximating the classica...
research
08/14/2022

Exact Exponential Algorithms for Clustering Problems

In this paper we initiate a systematic study of exact algorithms for wel...
research
02/01/2018

Sensitivity Sampling Over Dynamic Geometric Data Streams with Applications to k-Clustering

Sensitivity based sampling is crucial for constructing nearly-optimal co...
research
09/04/2018

Faster Balanced Clusterings in High Dimension

The problem of constrained clustering has attracted significant attentio...
research
10/02/2018

A Unified Framework for Clustering Constrained Data without Locality Property

In this paper, we consider a class of constrained clustering problems of...
research
12/19/2017

A remark on "Robust machine learning by median-of-means"

We explore the recent results announced in "Robust machine learning by m...
research
12/03/2019

Fast Algorithms for Geometric Consensuses

Let P be a set of n points in ^d in general position. A median hyperplan...

Please sign up or login with your details

Forgot password? Click here to reset