A performance portable implementation of the semi-Lagrangian algorithm in six dimensions

03/10/2023
by   Nils Schild, et al.
0

In this paper, we describe our approach to develop a simulation software application for the fully kinetic Vlasov equation which will be used to explore physics beyond the gyrokinetic model. Simulating the fully kinetic Vlasov equation requires efficient utilization of compute and storage capabilities due to the high dimensionality of the problem. In addition, the implementation needs to be extensibility regarding the physical model and flexible regarding the hardware for production runs. We start on the algorithmic background to simulate the 6-D Vlasov equation using a semi-Lagrangian algorithm. The performance portable software stack, which enables production runs on pure CPU as well as AMD or Nvidia GPU accelerated nodes, is presented. The extensibility of our implementation is guaranteed through the described software architecture of the main kernel, which achieves a memory bandwidth of almost 500 GB/s on a V100 Nvidia GPU and around 100 GB/s on an Intel Xeon Gold CPU using a single code base. We provide performance data on multiple node level architectures discussing utilized and further available hardware capabilities. Finally, the network communication bottleneck of 6-D grid based algorithms is quantified. A verification of physics beyond gyrokinetic theory for the example of ion Bernstein waves concludes the work.

READ FULL TEXT
research
07/18/2019

Semi-Lagrangian Vlasov simulation on GPUs

In this paper, our goal is to efficiently solve the Vlasov equation on G...
research
11/06/2018

GPU Acceleration of an Established Solar MHD Code using OpenACC

GPU accelerators have had a notable impact on high-performance computing...
research
10/14/2020

Performance Analysis of a Quantum Monte Carlo Application on Multiple Hardware Architectures Using the HPX Runtime

This paper describes how we successfully used the HPX programming model ...
research
06/28/2021

Leveraging GPU batching for scalable nonlinear programming through massive Lagrangian decomposition

We present the implementation of a trust-region Newton algorithm ExaTron...
research
03/04/2020

Multichannel Analysis of Surface Waves Accelerated (MASWAccelerated): Software for Efficient Surface Wave Inversion Using MPI and GPUs

Multichannel Analysis of Surface Waves (MASW) is a technique frequently ...
research
09/14/2020

PDFFlow: parton distribution functions on GPU

We present PDFFlow, a new software for fast evaluation of parton distrib...
research
12/19/2018

Targeting GPUs with OpenMP Directives on Summit: A Simple and Effective Fortran Experience

We use OpenMP directives to target hardware accelerators (GPUs) on Summi...

Please sign up or login with your details

Forgot password? Click here to reset