Coresets for Clustering in Geometric Intersection Graphs

03/02/2023
by   Sayan Bandyapadhyay, et al.
0

Designing coresets–small-space sketches of the data preserving cost of the solutions within (1±ϵ)-approximate factor–is an important research direction in the study of center-based k-clustering problems, such as k-means or k-median. Feldman and Langberg [STOC'11] have shown that for k-clustering of n points in general metrics, it is possible to obtain coresets whose size depends logarithmically in n. Moreover, such a dependency in n is inevitable in general metrics. A significant amount of recent work in the area is devoted to obtaining coresests whose sizes are independent of n (i.e., “small” coresets) for special metrics, like d-dimensional Euclidean spaces, doubling metrics, metrics of graphs of bounded treewidth, or those excluding a fixed minor. In this paper, we provide the first constructions of small coresets for k-clustering in the metrics induced by geometric intersection graphs, such as Euclidean-weighted Unit Disk/Square Graphs. These constructions follow from a general theorem that identifies two canonical properties of a graph metric sufficient for obtaining small coresets. The proof of our theorem builds on the recent work of Cohen-Addad, Saulpic, and Schwiegelshohn [STOC '21], which ensures small-sized coresets conditioned on the existence of an interesting set of centers, called “centroid set”. The main technical contribution of our work is the proof of the existence of such a small-sized centroid set for graphs that satisfy the two canonical geometric properties. The new coreset construction helps to design the first (1+ϵ)-approximation for center-based clustering problems in UDGs and USGs, that is fixed-parameter tractable in k and ϵ (FPT-AS).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/10/2019

Coresets for Clustering in Graphs of Bounded Treewidth

We initiate the study of coresets for clustering in graph metrics, i.e.,...
research
04/13/2021

A New Coreset Framework for Clustering

Given a metric space, the (k,z)-clustering problem consists of finding k...
research
05/12/2023

Parameterized Approximation for Robust Clustering in Discrete Geometric Spaces

We consider the well-studied Robust (k, z)-Clustering problem, which gen...
research
04/06/2023

Parameterized Approximation Schemes for Clustering with General Norm Objectives

This paper considers the well-studied algorithmic regime of designing a ...
research
08/24/2021

Linear-Size Universal Discretization of Geometric Center-Based Problems in Fixed Dimensions

Many geometric optimization problems can be reduced to finding points in...
research
04/16/2020

Coresets for Clustering in Excluded-minor Graphs and Beyond

Coresets are modern data-reduction tools that are widely used in data an...
research
08/24/2020

A Unified and Fine-Grained Approach for Light Spanners

Seminal works on light spanners from recent years provide near-optimal t...

Please sign up or login with your details

Forgot password? Click here to reset