High-Performance Massive Subgraph Counting using Pipelined Adaptive-Group Communication

04/25/2018
by   Langshi Chen, et al.
0

Subgraph counting aims to count the number of occurrences of a subgraph T (aka as a template) in a given graph G. The basic problem has found applications in diverse domains. The problem is known to be computationally challenging - the complexity grows both as a function of T and G. Recent applications have motivated solving such problems on massive networks with billions of vertices. In this chapter, we study the subgraph counting problem from a parallel computing perspective. We discuss efficient parallel algorithms for approximately resolving subgraph counting problems by using the color-coding technique. We then present several system-level strategies to substantially improve the overall performance of the algorithm in massive subgraph counting problems. We propose: 1) a novel pipelined Adaptive-Group communication pattern to improve inter-node scalability, 2) a fine-grained pipeline design to effectively reduce the memory space of intermediate results, 3) partitioning neighbor lists of subgraph vertices to achieve better thread concurrency and workload balance. Experimentation on an Intel Xeon E5 cluster shows that our implementation achieves 5x speedup of performance compared to the state-of-the-art work while reduces the peak memory utilization by a factor of 2 on large templates of 12 to 15 vertices and input graphs of 2 to 5 billions of edges.

READ FULL TEXT
research
03/11/2019

A GraphBLAS Approach for Subgraph Counting

Subgraph counting aims to count the occurrences of a subgraph template T...
research
01/23/2019

Fast and Robust Distributed Subgraph Enumeration

We study the classic subgraph enumeration problem under distributed sett...
research
09/23/2020

SubGraph2Vec: Highly-Vectorized Tree-likeSubgraph Counting

Subgraph counting aims to count occurrences of a template T in a given n...
research
09/22/2021

Intel Optane DCPMM and Serverless Computing

This report describes 1) how we use Intel's Optane DCPMM in the memory M...
research
01/09/2023

Space-Query Tradeoffs in Range Subgraph Counting and Listing

This paper initializes the study of range subgraph counting and range su...
research
07/19/2019

Parallel Algorithms for Butterfly Computations

Butterflies are the smallest non-trivial subgraph in bipartite graphs, a...
research
10/29/2019

A Survey on Subgraph Counting: Concepts, Algorithms and Applications to Network Motifs and Graphlets

Computing subgraph frequencies is a fundamental task that lies at the co...

Please sign up or login with your details

Forgot password? Click here to reset