Storage, Indexing, Query Processing, and Benchmarking in Centralized and Distributed RDF Engines: A Survey

09/22/2020
by   Waqas Ali, et al.
0

The recent advancements of the Semantic Web and Linked Data have changed the working of the traditional web. There is significant adoption of the Resource Description Framework (RDF) format for saving of web-based data. This massive adoption has paved the way for the development of various centralized and distributed RDF processing engines. These engines employ various mechanisms to implement critical components of the query processing engines such as data storage, indexing, language support, and query execution. All these components govern how queries are executed and can have a substantial effect on the query runtime. For example, the storage of RDF data in various ways significantly affects the data storage space required and the query runtime performance. The type of indexing approach used in RDF engines is critical for fast data lookup. The type of the underlying querying language (e.g., SPARQL or SQL) used for query execution is a crucial optimization component of the RDF storage solutions. Finally, query execution involving different join orders significantly affects the query response time. This paper provides a comprehensive review of centralized and distributed RDF engines in terms of storage, indexing, language support, and query execution.

READ FULL TEXT

page 3

page 12

page 16

page 20

page 28

research
02/25/2021

A Survey of RDF Stores SPARQL Engines for Querying Knowledge Graphs

Recent years have seen the growing adoption of non-relational data model...
research
07/20/2018

gSMat: A Scalable Sparse Matrix-based Join for SPARQL Query Processing

Resource Description Framework (RDF) has been widely used to represent i...
research
04/06/2021

DeepEverest: Accelerating Declarative Top-K Queries for Deep Neural Network Interpretation

We design, implement, and evaluate DeepEverest, a system for the efficie...
research
04/02/2021

An Empirical Evaluation of Cost-based Federated SPARQL Query Processing Engines

Finding a good query plan is key to the optimization of query runtime. T...
research
09/04/2022

Towards Adaptive Storage Views in Virtual Memory

Traditionally, DBMSs separate their storage layer from their indexing la...
research
12/12/2021

In-Memory Indexed Caching for Distributed Data Processing

Powerful abstractions such as dataframes are only as efficient as their ...
research
09/17/2023

Building a P2P RDF Store for Edge Devices

The Semantic Web technologies have been used in the Internet of Things (...

Please sign up or login with your details

Forgot password? Click here to reset