A Reachability Index for Recursive Label-Concatenated Graph Queries

03/16/2022
by   Chao Zhang, et al.
0

Reachability queries checking the existence of a path from a source node to a target node are fundamental operators for querying and processing graph data. Current approaches for index-based evaluation of reachability queries either focus on plain reachability or constraint-based reachability with alternation only. In this paper, for the first time we study the problem of index-based processing for recursive label-concatenated reachability queries, referred to as RLC queries. These queries check the existence of a path that can satisfy the constraint defined by a concatenation of at most k edge labels under the Kleene plus. Many practical graph database and network analysis applications exhibit RLC queries. However, their evaluation remains prohibitive in current graph database engines. We introduce the RLC index, the first reachability index to efficiently process RLC queries. The RLC index checks whether the source vertex can reach an intermediate vertex that can also reach the target vertex under a recursive label-concatenated constraint. We propose an indexing algorithm to build the RLC index, which guarantees the soundness and the completeness of query execution and avoids recording redundant index entries. Comprehensive experiments on real-world graphs show that the RLC index can significantly reduce both the offline processing cost and the memory overhead of transitive closure while improving query processing up to six orders of magnitude over online traversals. Finally, our open-source implementation of the RLC index significantly outperforms current mainstream graph engines for evaluating RLC queries.

READ FULL TEXT
research
07/23/2020

Reachability Queries with Label and Substructure Constraints on Knowledge Graphs

Since knowledge graphs (KGs) describe and model the relationships betwee...
research
01/23/2021

DBL: Efficient Reachability Queries on Dynamic Graphs (Complete Version)

Reachability query is a fundamental problem on graphs, which has been ex...
research
08/25/2020

Faster Reachability in Static Graphs

One of the most fundamental problems in computer science is the reachabi...
research
11/28/2018

Approximate Evaluation of Label-Constrained Reachability Queries

The current surge of interest in graph-based data models mirrors the usa...
research
05/18/2021

Reachability and Top-k Reachability Queries with Transfer Decay

The prevalence of location tracking systems has resulted in large volume...
research
03/05/2022

Efficient Reachability Ratio Computation for 2-hop Labeling Scheme

As one of the fundamental graph operations, reachability queries process...
research
12/16/2021

Evaluating Hybrid Graph Pattern Queries Using Runtime Index Graphs

Graph pattern matching is a fundamental operation for the analysis and e...

Please sign up or login with your details

Forgot password? Click here to reset