RTIndeX: Exploiting Hardware-Accelerated GPU Raytracing for Database Indexing

03/02/2023
by   Justus Henneberg, et al.
0

Data management on GPUs has become increasingly relevant due to a tremendous rise in processing power and available GPU memory. Just like in the CPU world, there is a need for performant GPU-resident index structures to speed up query processing. Unfortunately, mapping indexes efficiently to the highly parallel and hard-to-program hardware is challenging and often fails to yield the desired performance and flexibility. Therefore, we advocate to take a different route. Instead of proposing yet another hand-tailored index, we investigate whether we can exploit an indexing mechanism that is already built into modern GPUs: The raytracing hardware accelerator provided by NVIDIA RTX cards. To do so, we re-phrase database indexing as a raytracing problem, where we express the dataset to be indexed as primitives in a scene, and queries as rays. In this combination, coined RX in the following, lookups are performed as intersection tests in hardware by dedicated raytracing cores. To analyze the pros, cons, and usefulness of the raytracing pipeline for database indexing, we carefully evaluate RX along twelve dimensions: We first identify the optimal setup of the pipeline by evaluating different options regarding (1) how to express keys, (2) which primitives to use for that, (3) how to formulate point/range queries against these, and (4) how to update the index. Continuing with the best setup, we compare RX against a set of software-implemented GPU-resident index structures while varying (5) the number of inserts and queries, (6) the order of the data, (7) the batch size, (8) the hit/miss ratio, (9) the selectivity of range queries, (10) the key size, and (11) the distribution of inserts and queries. Finally, we evaluate (12) the impact of three different GPU generations on the performance. Our results show that RX is competitive against traditional counterparts in a variety of situations.

READ FULL TEXT

page 7

page 8

page 9

page 10

page 11

research
10/25/2019

Overlay Indexes: Efficiently Supporting Aggregate Range Queries and Authenticated Data Structures in Off-the-Shelf Databases

Commercial off-the-shelf DataBase Management Systems (DBMSes) are highly...
research
03/27/2022

GPU-Powered Spatial Database Engine for Commodity Hardware: Extended Version

Given the massive growth in the volume of spatial data, there is a great...
research
01/11/2018

Multidimensional Range Queries on Modern Hardware

Range queries over multidimensional data are an important part of databa...
research
06/27/2019

State-of-the-Art on Query Transaction Processing Acceleration

The vast amount of processing power and memory bandwidth provided by mod...
research
05/18/2020

A Two-level Spatial In-Memory Index

Very large volumes of spatial data increasingly become available and dem...
research
09/12/2022

Robust and Scalable Content-and-Structure Indexing (Extended Version)

Frequent queries on semi-structured hierarchical data are Content-and-St...
research
10/10/2018

Technical Report: KNN Joins Using a Hybrid Approach: Exploiting CPU/GPU Workload Characteristics

This paper studies finding the K nearest neighbors (KNN) of all points i...

Please sign up or login with your details

Forgot password? Click here to reset