BFS based distributed algorithm for parallel local directed sub-graph enumeration

01/06/2022
by   Itay Levinas, et al.
0

Estimating the frequency of sub-graphs is of importance for many tasks, including sub-graph isomorphism, kernel-based anomaly detection, and network structure analysis. While multiple algorithms were proposed for full enumeration or sampling-based estimates, these methods fail in very large graphs. Recent advances in parallelization allow for estimates of total sub-graphs counts in very large graphs. The task of counting the frequency of each sub-graph associated with each vertex also received excellent solutions for undirected graphs. However, there is currently no good solution for very large directed graphs. We here propose VDMC (Vertex specific Distributed Motif Counting) – a fully distributed algorithm to optimally count all the 3 and 4 vertices connected directed graphs (sub-graph motifs) associated with each vertex of a graph. VDMC counts each motif only once and its efficacy is linear in the number of counted motifs. It is fully parallelized to be efficient in GPU-based computation. VDMC is based on three main elements: 1) Ordering the vertices and only counting motifs containing increasing order vertices, 2) sub-ordering motifs based on the average length of the BFS composing the motif, and 3) removing isomorphisms only once for the entire graph. We here compare VDMC to analytical estimates of the expected number of motifs and show its accuracy. VDMC is available as a highly efficient CPU and GPU code with a novel data structure for efficient graph manipulation. We show the efficacy of VDMC and real-world graphs. VDMC allows for the precise analysis of sub-graph frequency around each vertex in large graphs and opens the way for the extension of methods until now limited to graphs of thousands of edges to graphs with millions of edges and above. GIT: https://github.com/louzounlab/graph-measures

READ FULL TEXT
research
03/26/2023

Asymptotic analysis and efficient random sampling of directed ordered acyclic graphs

Directed acyclic graphs (DAGs) are directed graphs in which there is no ...
research
01/29/2021

Counting orientations of graphs with no strongly connected tournaments

Let S_k(n) be the maximum number of orientations of an n-vertex graph G ...
research
04/20/2022

Scalable Motif Counting for Large-scale Temporal Graphs

One fundamental problem in temporal graph analysis is to count the occur...
research
05/04/2018

Connected greedy colouring in claw-free graphs

An ordering of the vertices of a graph is connected if every vertex (but...
research
05/27/2020

Counting graph orientations with no directed triangles

Alon and Yuster proved that the number of orientations of any n-vertex g...
research
02/12/2018

DiSLR: Distributed Sampling with Limited Redundancy For Triangle Counting in Graph Streams

Given a web-scale graph that grows over time, how should its edges be st...
research
04/18/2018

The Graph Exploration Problem with Advice

Moving an autonomous agent through an unknown environment is one of the ...

Please sign up or login with your details

Forgot password? Click here to reset