Correlation Clustering in Constant Many Parallel Rounds

06/15/2021
by   Vincent Cohen-Addad, et al.
0

Correlation clustering is a central topic in unsupervised learning, with many applications in ML and data mining. In correlation clustering, one receives as input a signed graph and the goal is to partition it to minimize the number of disagreements. In this work we propose a massively parallel computation (MPC) algorithm for this problem that is considerably faster than prior work. In particular, our algorithm uses machines with memory sublinear in the number of nodes in the graph and returns a constant approximation while running only for a constant number of rounds. To the best of our knowledge, our algorithm is the first that can provably approximate a clustering problem on graphs using only a constant number of MPC rounds in the sublinear memory regime. We complement our analysis with an experimental analysis of our techniques.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/16/2022

A Parallel Algorithm for (3 + ε)-Approximate Correlation Clustering

Grouping together similar elements in datasets is a common task in data ...
research
03/02/2022

Near-Optimal Correlation Clustering with Privacy

Correlation clustering is a central problem in unsupervised learning, wi...
research
05/07/2022

Almost 3-Approximate Correlation Clustering in Constant Rounds

We study parallel algorithms for correlation clustering. Each pair among...
research
07/13/2023

Breaking 3-Factor Approximation for Correlation Clustering in Polylogarithmic Rounds

In this paper, we study parallel algorithms for the correlation clusteri...
research
11/03/2017

Distributed Graph Clustering and Sparsification

Graph clustering is a fundamental computational problem with a number of...
research
09/14/2023

Massively-Parallel Heat Map Sorting and Applications To Explainable Clustering

Given a set of points labeled with k labels, we introduce the heat map s...
research
02/23/2021

Massively Parallel Correlation Clustering in Bounded Arboricity Graphs

Identifying clusters of similar elements in a set is a common objective ...

Please sign up or login with your details

Forgot password? Click here to reset