The Power of Uniform Sampling for k-Median

02/22/2023
by   Lingxiao Huang, et al.
0

We study the power of uniform sampling for k-Median in various metric spaces. We relate the query complexity for approximating k-Median, to a key parameter of the dataset, called the balancedness β∈ (0, 1] (with 1 being perfectly balanced). We show that any algorithm must make Ω(1 / β) queries to the point set in order to achieve O(1)-approximation for k-Median. This particularly implies existing constructions of coresets, a popular data reduction technique, cannot be query-efficient. On the other hand, we show a simple uniform sample of poly(k ϵ^-1β^-1) points suffices for (1 + ϵ)-approximation for k-Median for various metric spaces, which nearly matches the lower bound. We conduct experiments to verify that in many real datasets, the balancedness parameter is usually well bounded, and that the uniform sampling performs consistently well even for the case with moderately large balancedness, which justifies that uniform sampling is indeed a viable approach for solving k-Median.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/19/2018

Approximation Schemes for Capacitated Clustering in Doubling Metrics

Motivated by applications in redistricting, we consider the uniform capa...
research
01/05/2022

Deterministic metric 1-median selection with very few queries

Given an n-point metric space (M,d), metric 1-median asks for a point p∈...
research
04/28/2019

Tight FPT Approximations for k-Median and k-Means

We investigate the fine-grained complexity of approximating the classica...
research
04/06/2023

Parameterized Approximation Schemes for Clustering with General Norm Objectives

This paper considers the well-studied algorithmic regime of designing a ...
research
06/05/2023

Near-Optimal Quantum Coreset Construction Algorithms for Clustering

k-Clustering in ℝ^d (e.g., k-median and k-means) is a fundamental machin...
research
06/23/2022

The quarter median

We introduce and discuss a multivariate version of the classical median ...
research
03/16/2021

A refined continuity correction for the negative binomial distribution and asymptotics of the median

In this paper, we prove a local limit theorem and a refined continuity c...

Please sign up or login with your details

Forgot password? Click here to reset