Accelerating Range Minimum Queries with Ray Tracing Cores

06/05/2023
by   Enzo meneses, et al.
0

During the last decade GPU technology has shifted from pure general purpose computation to the inclusion of application specific integrated circuits (ASICs), such as Tensor Cores and Ray Tracing (RT) cores. Although these special purpose GPU cores were designed to further accelerate specific fields such as AI and real-time rendering, recent research has managed to exploit them to further accelerate other tasks that typically used regular GPU computing. In this work we present RTXRMQ, a new approach that can compute range minimum queries (RMQs) with RT cores. The main contribution is the proposal of a geometric solution for RMQ, where elements become triangles that are placed and shaped according to the element's value and position in the array, respectively, such that the closest hit of a ray launched from a point given by the query parameters corresponds to the result of that query. Experimental results show that RTXRMQ is currently best suited for small query ranges relative to the problem size, achieving up to 5× and 2.3× of speedup over state of the art CPU (HRMQ) and GPU (LCA) approaches, respectively. Although for medium and large query ranges RTXRMQ is currently surpassed by LCA, it is still competitive by being 2.5× and 4× faster than HRMQ which is a highly parallel CPU approach. Furthermore, performance scaling experiments across the latest RTX GPU architectures show that if the current RT scaling trend continues, then RTXRMQ's performance would scale at a higher rate than HRMQ and LCA, making the approach even more relevant for future high performance applications that employ batches of RMQs.

READ FULL TEXT

page 2

page 9

research
03/16/2023

RT-DBSCAN: Accelerating DBSCAN using Ray Tracing Hardware

General Purpose computing on Graphical Processing Units (GPGPU) has resu...
research
05/12/2023

Combining GPU Tracing Methods within a Single Ray Query

A recent trend in real-time rendering is the utilization of the new hard...
research
08/24/2022

A Scalable and Energy Efficient GPU Thread Map for m-Simplex Domains

This work proposes a new GPU thread map for m-simplex domains, that scal...
research
05/26/2023

RT-kNNS Unbound: Using RT Cores to Accelerate Unrestricted Neighbor Search

The problem of identifying the k-Nearest Neighbors (kNNS) of a point has...
research
02/16/2021

cuFINUFFT: a load-balanced GPU library for general-purpose nonuniform FFTs

Nonuniform fast Fourier transforms dominate the computational cost in ma...
research
05/16/2022

Accelerating X-Ray Tracing for Exascale Systems using Kokkos

The upcoming exascale computing systems Frontier and Aurora will draw mu...
research
02/24/2022

Point Containment Queries on Ray Tracing Cores for AMR Flow Visualization

Modern GPUs come with dedicated hardware to perform ray/triangle interse...

Please sign up or login with your details

Forgot password? Click here to reset