Fast GPU bounding boxes on tree-structured scenes

05/23/2022
by   Raph Levien, et al.
0

Computation of bounding boxes is a fundamental problem in high performance rendering, as it is an input to visibility culling and binning operations. In a scene description structured as a tree, clip nodes and blend nodes entail intersection and union of bounding boxes, respectively. These are straightforward to compute on the CPU using a sequential algorithm, but an efficient, parallel GPU algorithm is more elusive. This paper presents a fast and practical solution, with a new algorithm for the classic parentheses matching problem at its core. The core algorithm is presented abstractly (in terms of a PRAM abstraction), then with a concrete mapping to the thread, workgroup, and dispatch levels of real GPU hardware. The algorithm is implemented portably using compute shaders, and performance results show a dramatic speedup over a sequential CPU version, and indeed a reasonable fraction of maximum theoretical throughput of the GPU hardware. The immediate motivating application is 2D rendering, but the algorithms generalize to other domains, and the core parentheses matching problem has other applications including parsing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2021

Accelerating Compact Fractals with Tensor Core GPUs

This work presents a GPU thread mapping approach that allows doing fast ...
research
12/30/2009

Computing Principal Components Dynamically

In this paper we present closed-form solutions for efficiently updating ...
research
06/10/2021

Gaussian Bounding Boxes and Probabilistic Intersection-over-Union for Object Detection

Most object detection methods use bounding boxes to encode and represent...
research
01/03/2022

Squeeze: Efficient Compact Fractals for Tensor Core GPUs

This work presents Squeeze, an efficient compact fractal processing sche...
research
03/29/2020

Detection of 3D Bounding Boxes of Vehicles Using Perspective Transformation for Accurate Speed Measurement

Detection and tracking of vehicles captured by traffic surveillance came...
research
11/08/2018

Fast, High Precision Ray/Fiber Intersection using Tight, Disjoint Bounding Volumes

Analyzing and identifying the shortcomings of current subdivision method...
research
04/25/2020

Efficient GPU Thread Mapping on Embedded 2D Fractals

This work proposes a new approach for mapping GPU threads onto a family ...

Please sign up or login with your details

Forgot password? Click here to reset