Massively Parallel and Dynamic Algorithms for Minimum Size Clustering

06/04/2021
by   Alessandro Epasto, et al.
0

In this paper, we study the r-gather problem, a natural formulation of minimum-size clustering in metric spaces. The goal of r-gather is to partition n points into clusters such that each cluster has size at least r, and the maximum radius of the clusters is minimized. This additional constraint completely changes the algorithmic nature of the problem, and many clustering techniques fail. Also previous dynamic and parallel algorithms do not achieve desirable complexity. We propose algorithms both in the Massively Parallel Computation (MPC) model and in the dynamic setting. Our MPC algorithm handles input points from the Euclidean space ℝ^d. It computes an O(1)-approximate solution of r-gather in O(log^ε n) rounds using total space O(n^1+γ· d) for arbitrarily small constants ε,γ > 0. In addition our algorithm is fully scalable, i.e., there is no lower bound on the memory per machine. Our dynamic algorithm maintains an O(1)-approximate r-gather solution under insertions/deletions of points in a metric space with doubling dimension d. The update time is r · 2^O(d)·log^O(1)Δ and the query time is 2^O(d)·log^O(1)Δ, where Δ is the ratio between the largest and the smallest distance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/12/2023

On Parallel k-Center Clustering

We consider the classic k-center problem in a parallel setting, on the l...
research
07/15/2023

Fully Scalable MPC Algorithms for Clustering in High Dimension

We design new algorithms for k-clustering in high-dimensional Euclidean ...
research
07/02/2023

Massively Parallel Algorithms for the Stochastic Block Model

Learning the community structure of a large-scale graph is a fundamental...
research
04/26/2022

Polylogarithmic Sketches for Clustering

Given n points in ℓ_p^d, we consider the problem of partitioning points ...
research
12/13/2021

Optimal Fully Dynamic k-Centers Clustering

We present the first algorithm for fully dynamic k-centers clustering in...
research
08/01/2023

Massively Parallel Algorithms for High-Dimensional Euclidean Minimum Spanning Tree

We study the classic Euclidean Minimum Spanning Tree (MST) problem in th...
research
07/13/2023

Breaking 3-Factor Approximation for Correlation Clustering in Polylogarithmic Rounds

In this paper, we study parallel algorithms for the correlation clusteri...

Please sign up or login with your details

Forgot password? Click here to reset