Robust Correlation Clustering with Asymmetric Noise

10/15/2021
by   Jimit Majmudar, et al.
0

Graph clustering problems typically aim to partition the graph nodes such that two nodes belong to the same partition set if and only if they are similar. Correlation Clustering is a graph clustering formulation which: (1) takes as input a signed graph with edge weights representing a similarity/dissimilarity measure between the nodes, and (2) requires no prior estimate of the number of clusters in the input graph. However, the combinatorial optimization problem underlying Correlation Clustering is NP-hard. In this work, we propose a novel graph generative model, called the Node Factors Model (NFM), which is based on generating feature vectors/embeddings for the graph nodes. The graphs generated by the NFM contain asymmetric noise in the sense that there may exist pairs of nodes in the same cluster which are negatively correlated. We propose a novel Correlation Clustering algorithm, called , using techniques from semidefinite programming. Using a combination of theoretical and computational results, we demonstrate that recovers nodes with sufficiently strong cluster membership in graph instances generated by the NFM, thereby making progress towards establishing the provable robustness of our proposed algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/11/2021

Local Correlation Clustering with Asymmetric Classification Errors

In the Correlation Clustering problem, we are given a complete weighted ...
research
11/23/2021

A Modular Framework for Centrality and Clustering in Complex Networks

The structure of many complex networks includes edge directionality and ...
research
08/11/2021

Correlation Clustering with Asymmetric Classification Errors

In the Correlation Clustering problem, we are given a weighted graph G w...
research
12/05/2014

An iterative step-function estimator for graphons

Exchangeable graphs arise via a sampling procedure from measurable funct...
research
08/29/2022

A Distributed Multilevel Memetic Algorithm for Signed Graph Clustering

In real-world applications, interactions between two entities can be usu...
research
01/28/2023

ClusterFuG: Clustering Fully connected Graphs by Multicut

We propose a graph clustering formulation based on multicut (a.k.a. weig...
research
03/02/2022

Near-Optimal Correlation Clustering with Privacy

Correlation clustering is a central problem in unsupervised learning, wi...

Please sign up or login with your details

Forgot password? Click here to reset