A GPU-Accelerated Barycentric Lagrange Treecode

03/03/2020
by   Nathan Vaughn, et al.
0

We present an MPI + OpenACC implementation of the kernel-independent barycentric Lagrange treecode (BLTC) for fast summation of particle interactions on GPUs. The distributed memory parallelization uses recursive coordinate bisection for domain decomposition and MPI remote memory access to build locally essential trees on each rank. The particle interactions are organized into target batch/source cluster interactions which efficiently map onto the GPU; target batching provides an outer level of parallelism, while the direct sum form of the barycentric particle-cluster approximation provides an inner level of parallelism. The GPU-accelerated BLTC performance is demonstrated on several test cases up to 1 billion particles interacting via the Coulomb potential and Yukawa potential.

READ FULL TEXT

page 4

page 9

research
12/13/2020

A GPU-Accelerated Fast Summation Method Based on Barycentric Lagrange Interpolation and Dual Tree Traversal

We present the barycentric Lagrange dual tree traversal (BLDTT) fast sum...
research
09/13/2023

GPU Scheduler for De Novo Genome Assembly with Multiple MPI Processes

De Novo Genome assembly is one of the most important tasks in computatio...
research
10/06/2022

Memory-Efficient Recursive Evaluation of 3-Center Gaussian Integrals

To improve the efficiency of Gaussian integral evaluation on modern acce...
research
03/24/2020

Gadget3 on GPUs with OpenACC

We present preliminary results of a GPU porting of all main Gadget3 modu...
research
07/12/2023

Cornerstone: Octree Construction Algorithms for Scalable Particle Simulations

This paper presents an octree construction method, called Cornerstone, t...
research
03/05/2023

Acceleration of a production Solar MHD code with Fortran standard parallelism: From OpenACC to `do concurrent'

There is growing interest in using standard language constructs for acce...
research
10/27/2020

A GPU-accelerated adaptive FSAI preconditioner for massively parallel simulations

The solution of linear systems of equations is a central task in a numbe...

Please sign up or login with your details

Forgot password? Click here to reset