A Stitch in Time Saves Nine -- SPARQL querying of Property Graphs using Gremlin Traversals

01/09/2018
by   Harsh Thakkar, et al.
0

Knowledge graphs have become popular over the past decade and frequently rely on the Resource Description Framework (RDF) or Property Graph (PG) databases as data models. However, the query languages for these two data models -- SPARQL for RDF and the property graph traversal language Gremlin -- are lacking interoperability. We present Gremlinator, a novel SPARQL to Gremlin translator. Gremlinator translates SPARQL queries to Gremlin traversals for executing graph pattern matching queries over graph databases. This allows to access and query a wide variety of Graph Data Management Systems (DMSs) using the W3C standardized SPARQL and avoid the steep learning curve of a new Graph Query Language (GQL). Gremlin is a graph computing system agnostic traversal language (covering both OLTP graph database or OLAP graph processors), making it a desirable choice for supporting interoperability for querying Graph DMSs. We present an empirical evaluation for the validity of our approach by formalizing the graph pattern matching construct of Gremlin and illustrate its mapping to corresponding SPARQL queries. Moreover, we also present a proof-of-concept implementation of our approach, demonstrate its validity and applicability by executing SPARQL queries on top of leading Graph stores (Neo4J, Sparksee, and Apache TinkerGraph) and compare their performances with RDF stores (Openlink Virtuoso, 4Store, and JenaTDB). The results indicate that, for complex queries (such as Star-shaped), Gremlin pattern matching traversals out-perform their corresponding SPARQL queries significantly, including their translation time. Gremlinator currently covers a subset of the SPARQL 1.0 specification, specifically the SELECT queries.

READ FULL TEXT
research
01/25/2018

Killing Two Birds with One Stone -- Querying Property Graphs using SPARQL via GREMLINATOR

Knowledge graphs have become popular over the past decade and frequently...
research
10/22/2018

Fast Dual Simulation Processing of Graph Database Queries (Supplement)

Graph database query languages feature expressive, yet computationally e...
research
04/02/2023

S2CTrans: Building a bridge from SPARQL to Cypher

In graph data applications, data is primarily maintained using two model...
research
07/14/2023

PG-Triggers: Triggers for Property Graphs

Graph databases are emerging as the leading data management technology f...
research
10/26/2021

Graph? Yes! Which one? Help!

Amazon Neptune is a graph database service that supports two graph (meta...
research
11/10/2020

Answer Graph: Factorization Matters in Large Graphs

Our answer-graph method to evaluate SPARQL conjunctive queries (CQs) fin...
research
08/17/2019

Towards an Integrated Graph Algebra for Graph Pattern Matching with Gremlin (Extended Version)

Graph data management (also called NoSQL) has revealed beneficial charac...

Please sign up or login with your details

Forgot password? Click here to reset