Breaking 3-Factor Approximation for Correlation Clustering in Polylogarithmic Rounds

07/13/2023
by   Nairen Cao, et al.
0

In this paper, we study parallel algorithms for the correlation clustering problem, where every pair of two different entities is labeled with similar or dissimilar. The goal is to partition the entities into clusters to minimize the number of disagreements with the labels. Currently, all efficient parallel algorithms have an approximation ratio of at least 3. In comparison with the 1.994+ϵ ratio achieved by polynomial-time sequential algorithms [CLN22], a significant gap exists. We propose the first poly-logarithmic depth parallel algorithm that achieves a better approximation ratio than 3. Specifically, our algorithm computes a (2.4+ϵ)-approximate solution and uses Õ(m^1.5) work. Additionally, it can be translated into a Õ(m^1.5)-time sequential algorithm and a poly-logarithmic rounds sublinear-memory MPC algorithm with Õ(m^1.5) total memory. Our approach is inspired by Awerbuch, Khandekar, and Rao's [AKR12] length-constrained multi-commodity flow algorithm, where we develop an efficient parallel algorithm to solve a truncated correlation clustering linear program of Charikar, Guruswami, and Wirth [CGW05]. Then we show the solution of the truncated linear program can be rounded with a factor of at most 2.4 loss by using the framework of [CMSY15]. Such a rounding framework can then be implemented using parallel pivot-based approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/07/2022

Almost 3-Approximate Correlation Clustering in Constant Rounds

We study parallel algorithms for correlation clustering. Each pair among...
research
06/15/2021

Correlation Clustering in Constant Many Parallel Rounds

Correlation clustering is a central topic in unsupervised learning, with...
research
05/16/2022

A Parallel Algorithm for (3 + ε)-Approximate Correlation Clustering

Grouping together similar elements in datasets is a common task in data ...
research
06/23/2022

Hierarchical Agglomerative Graph Clustering in Poly-Logarithmic Depth

Obtaining scalable algorithms for hierarchical agglomerative clustering ...
research
06/04/2021

Massively Parallel and Dynamic Algorithms for Minimum Size Clustering

In this paper, we study the r-gather problem, a natural formulation of m...
research
07/15/2023

Fully Scalable MPC Algorithms for Clustering in High Dimension

We design new algorithms for k-clustering in high-dimensional Euclidean ...
research
01/09/2019

Network Interdiction Using Adversarial Traffic Flows

Traditional network interdiction refers to the problem of an interdictor...

Please sign up or login with your details

Forgot password? Click here to reset