Communication-Optimal Distributed Dynamic Graph Clustering

11/14/2018
by   Chun Jiang Zhu, et al.
0

We consider the problem of clustering graph nodes over large-scale dynamic graphs, such as citation networks, images and web networks, when graph updates such as node/edge insertions/deletions are observed distributively. We propose communication-efficient algorithms for two well-established communication models namely the message passing and the blackboard models. Given a graph with n nodes that is observed at s remote sites over time [1,t], the two proposed algorithms have communication costs Õ(ns) and Õ(n+s) (Õ hides a polylogarithmic factor), almost matching their lower bounds, Ω(ns) and Ω(n+s), respectively, in the message passing and the blackboard models. More importantly, we prove that at each time point in [1,t] our algorithms generate clustering quality nearly as good as that of centralizing all updates up to that time and then applying a standard centralized clustering algorithm. We conducted extensive experiments on both synthetic and real-life datasets which confirmed the communication efficiency of our approach over baseline algorithms while achieving comparable clustering results.

READ FULL TEXT

page 7

page 10

research
02/19/2023

Communication-Efficient Distributed Graph Clustering and Sparsification under Duplication Models

In this paper, we consider the problem of clustering graph nodes and spa...
research
08/17/2023

Half-Hop: A graph upsampling approach for slowing down message passing

Message passing neural networks have shown a lot of success on graph-str...
research
11/14/2019

Graph Spanners in the Message-Passing Model

Graph spanners are sparse subgraphs which approximately preserve all pai...
research
04/29/2018

Learning Data Dependency with Communication Cost

In this paper, we consider the problem of recovering a graph that repres...
research
06/23/2020

Optimised allgatherv, reduce_scatter and allreduce communication in message-passing systems

Collective communications, namely the patterns allgatherv, reduce_scatte...
research
03/12/2018

Clustering with Simultaneous Local and Global View of Data: A message passing based approach

A good clustering algorithm should not only be able to discover clusters...
research
09/28/2016

StruClus: Structural Clustering of Large-Scale Graph Databases

We present a structural clustering algorithm for large-scale datasets of...

Please sign up or login with your details

Forgot password? Click here to reset