Efficient GPU Thread Mapping on Embedded 2D Fractals

04/25/2020
by   Cristóbal A. Navarro, et al.
0

This work proposes a new approach for mapping GPU threads onto a family of discrete embedded 2D fractals. A block-space map λ: Z_E^2Z_F^2 is proposed, from Euclidean parallel space E to embedded fractal space F, that maps in O(log_2 log_2(n)) time and uses no more than O(n^H) threads with H being the Hausdorff dimension of the fractal, making it parallel space efficient. When compared to a bounding-box (BB) approach, λ(ω) offers a sub-exponential improvement in parallel space and a monotonically increasing speedup n > n_0. The Sierpinski gasket fractal is used as a particular case study and the experimental performance results show that λ(ω) reaches up to 9× of speedup over the bounding-box approach. A tensor-core based implementation of λ(ω) is also proposed for modern GPUs, providing up to ∼40% of extra performance. The results obtained in this work show that doing efficient GPU thread mapping on fractal domains can significantly improve the performance of several applications that work with this type of geometry.

READ FULL TEXT
research
10/25/2021

Accelerating Compact Fractals with Tensor Core GPUs

This work presents a GPU thread mapping approach that allows doing fast ...
research
08/24/2022

A Scalable and Energy Efficient GPU Thread Map for m-Simplex Domains

This work proposes a new GPU thread map for m-simplex domains, that scal...
research
01/03/2022

Squeeze: Efficient Compact Fractals for Tensor Core GPUs

This work presents Squeeze, an efficient compact fractal processing sche...
research
09/22/2021

GPU4S: Embedded GPUs in Space – Latest Project Updates

Following the trend of other safety-critical industries like automotive ...
research
01/15/2020

GPU Tensor Cores for fast Arithmetic Reductions

This work proposes a GPU tensor core approach that encodes the arithmeti...
research
05/23/2022

Fast GPU bounding boxes on tree-structured scenes

Computation of bounding boxes is a fundamental problem in high performan...
research
10/19/2017

A Fast and Generic GPU-Based Parallel Reduction Implementation

Reduction operations are extensively employed in many computational prob...

Please sign up or login with your details

Forgot password? Click here to reset