DwarvesGraph: A High-Performance Graph Mining System with Pattern Decomposition

08/21/2020
by   Jingji Chen, et al.
0

Graph mining tasks, which focus on extracting structural information from subgraphs, are gaining increasing attention recently. While the performance of recent systems—Arabesque, RStream, AutoMine, GraphZero, Pangolin and Peregrine—have been improved significantly over time, e.g., Peregrine can be up to 1317x and 2016x faster than Arabesque and RStream, all current systems treat each pattern to be mined as a whole. Since the computation cost could increase rapidly as the pattern size grows, the key obstacle is the size of patterns a system can handle, e.g., Automine fails to mine all 6-motif patterns within one week on a small graph with 7k vertices. To overcome the challenge, this paper takes a drastically different approach focusing on reducing pattern size. We present DwarvesGraph, the first graph mining system based on pattern decomposition, which first decomposes the target pattern into several sub-patterns, and then computes the count of each. The results of the target pattern can be calculated using the sub-pattern counts with very low additional cost. Although decomposition-based algorithms have been studied for years, this paper addresses several key system challenges. Specifically, we propose: 1) an efficient search method to jointly determine the decomposition of all concrete patterns of an application, considering the computation cost and cross-pattern computation reuse; 2) the partial symmetry breaking technique to eliminate redundant enumeration for each sub-pattern while preserving equivalence of computation semantics; and 3) a decomposition-centric programming model with efficient supports for pattern existence query and advanced graph mining applications such as FSM. Our extensive experiments show that DwarvesGraph is orders of magnitude faster than all existing state-of-the-art systems and capable of mining large patterns that none of them can handle.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/23/2020

GraphPi: High Performance Graph Pattern Matching through Effective Redundancy Elimination

Graph pattern matching, which aims to discover structural patterns in gr...
research
12/08/2020

Pattern Morphing for Efficient Graph Mining

Graph mining applications analyze the structural properties of large gra...
research
11/28/2019

GraphZero: Breaking Symmetry for Efficient Graph Mining

Graph mining for structural patterns is a fundamental task in many appli...
research
04/06/2020

Peregrine: A Pattern-Aware Graph Mining System

Graph mining workloads aim to extract structural properties of a graph b...
research
05/08/2021

Kudu: An Efficient and Scalable Distributed Graph Pattern Mining Engine

This paper proposes Kudu, a general distributed execution engine with a ...
research
12/08/2022

Efficient Strategies for Graph Pattern Mining Algorithms on GPUs

Graph Pattern Mining (GPM) is an important, rapidly evolving, and comput...
research
09/14/2018

Graph Pattern Mining and Learning through User-defined Relations (Extended Version)

In this work we propose R-GPM, a parallel computing framework for graph ...

Please sign up or login with your details

Forgot password? Click here to reset