Hierarchical Characteristic Set Merging for Optimizing SPARQL Queries in Heterogeneous RDF

09/07/2018
by   Marios Meimaris, et al.
0

Characteristic sets (CS) organize RDF triples based on the set of properties characterizing their subject nodes. This concept is recently used in indexing techniques, as it can capture the implicit schema of RDF data. While most CS-based approaches yield significant improvements in space and query performance, they fail to perform well in the presence of schema heterogeneity, i.e., when the number of CSs becomes very large, resulting in a highly partitioned data organization. In this paper, we address this problem by introducing a novel technique, for merging CSs based on their hierarchical structure. Our technique employs a lattice to capture the hierarchical relationships between CSs, identifies dense CSs and merges dense CSs with their ancestors, thus reducing the size of the CSs as well as the links between them. We implemented our algorithm on top of a relational backbone, where each merged CS is stored in a relational table, and we performed an extensive experimental study to evaluate the performance and impact of merging to the storage and querying of RDF datasets, indicating significant improvements.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/26/2021

An Automatic Schema-Instance Approach for Merging Multidimensional Data Warehouses

Using data warehouses to analyse multidimensional data is a significant ...
research
09/01/2021

MORTAL: A Tool of Automatically Designing Relational Storage Schemas for Multi-model Data through Reinforcement Learning

Considering relational databases having powerful capabilities in handlin...
research
10/17/2019

An LSM-based Tuple Compaction Framework for Apache AsterixDB

Document database systems store self-describing records, such as JSON, "...
research
08/07/2023

A Polystore Architecture Using Knowledge Graphs to Support Queries on Heterogeneous Data Stores

Modern applications commonly need to manage dataset types composed of he...
research
04/04/2022

Adaptive Merging on Phase Change Memory

Indexing is a well-known database technique used to facilitate data acce...
research
11/26/2019

Prediction of Horizontal Data Partitioning Through Query Execution Cost Estimation

The excessively increased volume of data in modern data management syste...
research
10/22/2020

Accelerating computational modeling and design of high-entropy alloys

With huge design spaces for unique chemical and mechanical properties, w...

Please sign up or login with your details

Forgot password? Click here to reset