Load Balanced Semantic Aware Distributed RDF Graph

07/22/2021
by   Ami Pandat, et al.
0

The modern day semantic applications store data as Resource Description Framework (RDF) data.Due to Proliferation of RDF Data, the efficient management of huge RDF data has become essential. A number of approaches pertaining to both relational and graph-based have been devised to handle this huge data. As the relational approach suffers from query joins, we propose a semantic aware graph based partitioning method. The partitioned fragments are further allocated in a load balanced way. For efficient query processing, partial replication is implemented. It reduces Inter node Communication thereby accelerating queries on distributed RDF Graph. This approach has been demonstrated in two phases partitioning and Distribution of Linked Observation Data (LOD). The time complexity for partitioning and distribution of Load Balanced Semantic Aware RDF Graph (LBSD) is O(n) where n is the number of triples which is demonstrated by linear increment in algorithm execution time (AET) for LOD data scaled from 1x to 5x. LBSD has been found to behave well till 4x. LBSD is compared with the state of the art relational and graph-based partitioning techniques. LBSD records 71 four query types. For most frequent query types, Linear and Star, on an average 65 The optimal replication level has been found to be 12

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/12/2022

Query Complexity Based Optimal Processing of Raw Data

The paper aims to find an efficient way for processing large datasets ha...
research
02/11/2019

Accelerating Partial Evaluation in Distributed SPARQL Query Evaluation

Partial evaluation has recently been used for processing SPARQL queries ...
research
01/27/2020

Load-Balanced Bottleneck Objectives in Process Mapping

We propose a new problem formulation for graph partitioning that is tail...
research
05/30/2018

Q-Graph: Preserving Query Locality in Multi-Query Graph Processing

Arising user-centric graph applications such as route planning and perso...
research
02/16/2018

PRoST: Distributed Execution of SPARQL Queries Using Mixed Partitioning Strategies

The rapidly growing size of RDF graphs in recent years necessitates dist...
research
05/03/2018

Balanced k-means for Parallel Geometric Partitioning

Mesh partitioning is an indispensable tool for efficient parallel numeri...
research
11/03/2020

Distributing Sparse Matrix/Graph Applications in Heterogeneous Clusters – an Experimental Study

Many problems in scientific and engineering applications contain sparse ...

Please sign up or login with your details

Forgot password? Click here to reset