Fast, feature-rich weakly-compressible SPH on GPU: coding strategies and compiler choices

07/22/2022
by   Giuseppe Bilotta, et al.
0

GPUSPH was the first implementation of the weakly-compressible Smoothed Particle Hydrodynamics method to run entirely on GPU using CUDA. Version 5, released in June 2018, features a radical restructuring of the code, offering a more structured implementation of several features and specialized optimization of most heavy-duty computational kernels. While these improvements have led to a measurable performance boost (ranging from 15% to 30% depending on the test case and hardware configuration), it has also uncovered some of the limitations of the official CUDA compiler () offered by NVIDIA, especially in regard to developer friendliness. This has led to an effort to support alternative compilers, particularly Clang, with surprising performance gains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/24/2018

Improving OpenCL Performance by Specializing Compiler Phase Selection and Ordering

Automatic compiler phase selection/ordering has traditionally been focus...
research
07/18/2022

Implementation of a Didactic Compiler for a superset of PL/0

This article describes the features of a compiler for a superset languag...
research
04/20/2023

Backporting RISC-V Vector assembly

Leveraging vectorisation, the ability for a CPU to apply operations to m...
research
11/23/2021

Generating GPU Compiler Heuristics using Reinforcement Learning

GPU compilers are complex software programs with many optimizations spec...
research
07/06/2017

Cooperative Kernels: GPU Multitasking for Blocking Algorithms (Extended Version)

There is growing interest in accelerating irregular data-parallel algori...
research
08/28/2022

ECP SOLLVE: Validation and Verification Testsuite Status Update and Compiler Insight for OpenMP

The OpenMP language continues to evolve with every new specification rel...
research
11/20/2020

AZP: Automatic Specialization for Zero Values in Gaming Applications

Recent research has shown that dynamic zeros in shader programs of gamin...

Please sign up or login with your details

Forgot password? Click here to reset