WawPart: Workload-Aware Partitioning of Knowledge Graphs

03/28/2022
by   Amitabh Priyadarshi, et al.
0

Large-scale datasets in the form of knowledge graphs are often used in numerous domains, today. A knowledge graphs size often exceeds the capacity of a single computer system, especially if the graph must be stored in main memory. To overcome this, knowledge graphs can be partitioned into multiple sub-graphs and distributed as shards among many computing nodes. However, performance of many common tasks performed on graphs, such as querying, suffers, as a result. This is due to distributed joins mandated by graph edges crossing (cutting) the partitions. In this paper, we propose a method of knowledge graph partitioning that takes into account a set of queries (workload). The resulting partitioning aims to reduces the number of distributed joins and improve the workload performance. Critical features identified in the query workload and the knowledge graph are used to cluster the queries and then partition the graph. Queries are rewritten to account for the graph partitioning. Our evaluation results demonstrate the performance improvement in workload processing time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/28/2022

AWAPart: Adaptive Workload-Aware Partitioning of Knowledge Graphs

Large-scale knowledge graphs are increasingly common in many domains. Th...
research
11/17/2017

Loom: Query-aware Partitioning of Online Graphs

As with general graph processing systems, partitioning data over a clust...
research
05/14/2019

Query Processing on Large Graphs: Approaches To Scalability and Response Time Trade Offs

With the advent of social networks and the web, the graph sizes have gro...
research
02/11/2019

Accelerating Partial Evaluation in Distributed SPARQL Query Evaluation

Partial evaluation has recently been used for processing SPARQL queries ...
research
11/26/2019

Prediction of Horizontal Data Partitioning Through Query Execution Cost Estimation

The excessively increased volume of data in modern data management syste...
research
04/18/2020

DGL-KE: Training Knowledge Graph Embeddings at Scale

Knowledge graphs have emerged as a key abstraction for organizing inform...
research
09/10/2021

Mixture-of-Partitions: Infusing Large Biomedical Knowledge Graphs into BERT

Infusing factual knowledge into pre-trained models is fundamental for ma...

Please sign up or login with your details

Forgot password? Click here to reset