TripleID-Q: RDF Query Processing Framework using GPU

07/04/2018
by   Chantana Chantrapornchai, et al.
0

Resource Description Framework (RDF) data represents information linkage around the Internet. It uses Inter- nationalized Resources Identifier (IRI) which can be referred to external information. Typically, an RDF data is serialized as a large text file which contains millions of relationships. In this work, we propose a framework based on TripleID-Q, for query processing of large RDF data in a GPU. The key elements of the framework are 1) a compact representation suitable for a Graphics Processing Unit (GPU) and 2) its simple representation conversion method which optimizes the preprocessing overhead. Together with the framework, we propose parallel algorithms which utilize thousands of GPU threads to look for specific data for a given query as well as to perform basic query operations such as union, join, and filter. The TripleID representation is smaller than the original representation 3-4 times. Querying from TripleID using a GPU is up to 108 times faster than using the traditional RDF tool. The speedup can be more than 1,000 times over the traditional RDF store when processing a complex query with union and join of many subqueries.

READ FULL TEXT
research
04/25/2019

GPU-based Efficient Join Algorithms on Hadoop

The growing data has brought tremendous pressure for query processing an...
research
07/20/2018

gSMat: A Scalable Sparse Matrix-based Join for SPARQL Query Processing

Resource Description Framework (RDF) has been widely used to represent i...
research
12/21/2022

Resource Utilization Monitoring for Raw Data Query Processing

Scientific experiments, simulations, and modern applications generate la...
research
05/23/2018

GPU Accelerated Cascade Hashing Image Matching for Large Scale 3D Reconstruction

Image feature point matching is a key step in Structure from Motion(SFM)...
research
03/02/2023

Sampling over Union of Joins

Data scientists often draw on multiple relational data sources for analy...
research
10/04/2018

GPU based Parallel Optimization for Real Time Panoramic Video Stitching

Panoramic video is a sort of video recorded at the same point of view to...
research
12/26/2019

Efficient processing of raster and vector data

In this work, we propose a framework to store and manage spatial data, w...

Please sign up or login with your details

Forgot password? Click here to reset