Faster Deterministic Approximation Algorithms for Correlation Clustering and Cluster Deletion

11/20/2021
by   Nate Veldt, et al.
0

Correlation clustering is a framework for partitioning datasets based on pairwise similarity and dissimilarity scores, and has been used for diverse applications in bioinformatics, social network analysis, and computer vision. Although many approximation algorithms have been designed for this problem, the best theoretical results rely on obtaining lower bounds via expensive linear programming relaxations. In this paper we prove new relationships between correlation clustering problems and edge labeling problems related to the principle of strong triadic closure. We use these connections to develop new approximation algorithms for correlation clustering that have deterministic constant factor approximation guarantees and avoid the canonical linear programming relaxation. Our approach also extends to a variant of correlation clustering called cluster deletion, that strictly prohibits placing negative edges inside clusters. Our results include 4-approximation algorithms for cluster deletion and correlation clustering, based on simplified linear programs with far fewer constraints than the canonical relaxations. More importantly, we develop faster techniques that are purely combinatorial, based on computing maximal matchings in certain auxiliary graphs and hypergraphs. This leads to a combinatorial 6-approximation for complete unweighted correlation clustering, which is the best deterministic result for any method that does not rely on linear programming. We also present the first combinatorial constant factor approximation for cluster deletion.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2023

Faster Approximation Algorithms for Parameterized Graph Clustering and Edge Labeling

Graph clustering is a fundamental task in network analysis where the goa...
research
09/25/2018

Correlation Clustering Generalized

We present new results for LambdaCC and MotifCC, two recently introduced...
research
10/22/2019

Hypergraph clustering with categorical edge labels

Graphs and networks are a standard model for describing data or systems ...
research
01/30/2023

Fast Combinatorial Algorithms for Min Max Correlation Clustering

We introduce fast algorithms for correlation clustering with respect to ...
research
10/14/2019

Graph Clustering in All Parameter Regimes

Resolution parameters in graph clustering represent a size and quality t...
research
07/16/2020

A Tight Approximation Algorithm for the Cluster Vertex Deletion Problem

We give the first 2-approximation algorithm for the cluster vertex delet...
research
10/23/2017

Near-Optimal Clustering in the k-machine model

The clustering problem, in its many variants, has numerous applications ...

Please sign up or login with your details

Forgot password? Click here to reset