Cornerstone: Octree Construction Algorithms for Scalable Particle Simulations

07/12/2023
by   Sebastian Keller, et al.
0

This paper presents an octree construction method, called Cornerstone, that facilitates global domain decomposition and interactions between particles in mesh-free numerical simulations. Our method is based on algorithms developed for 3D computer graphics, which we extend to distributed high performance computing (HPC) systems. Cornerstone yields global and locally essential octrees and is able to operate on all levels of tree hierarchies in parallel. The resulting octrees are suitable for supporting the computation of various kinds of short and long range interactions in N-body methods, such as Barnes-Hut and the Fast Multipole Method (FMM). While we provide a CPU implementation, Cornerstone may run entirely on GPUs. This results in significantly faster tree construction compared to execution on CPUs and serves as a powerful building block for the design of simulation codes that move beyond an offloading approach, where only numerically intensive tasks are dispatched to GPUs. With data residing exclusively in GPU memory, Cornerstone eliminates data movements between CPUs and GPUs. As an example, we employ Cornerstone to generate locally essential octrees for a Barnes-Hut treecode running on almost the full LUMI-G system with up to 8 trillion particles.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/21/2023

Efficient and scalable hybrid fluid-particle simulations with geometrically resolved particles on heterogeneous CPU-GPU architectures

In recent years, it has become increasingly popular to accelerate numeri...
research
09/19/2017

Magnus integrators on multicore CPUs and GPUs

In the present paper we consider numerical methods to solve the Schrödin...
research
12/13/2020

A GPU-Accelerated Fast Summation Method Based on Barycentric Lagrange Interpolation and Dual Tree Traversal

We present the barycentric Lagrange dual tree traversal (BLDTT) fast sum...
research
10/07/2019

Finding Neighbors in a Forest: A b-tree for Smoothed Particle Hydrodynamics Simulations

Finding the exact close neighbors of each fluid element in mesh-free com...
research
10/10/2017

SoAx: A generic C++ Structure of Arrays for handling Particles in HPC Codes

The numerical study of physical problems often require integrating the d...
research
03/03/2020

A GPU-Accelerated Barycentric Lagrange Treecode

We present an MPI + OpenACC implementation of the kernel-independent bar...
research
03/09/2021

Fast tree-based algorithms for DBSCAN on GPUs

DBSCAN is a well-known density-based clustering algorithm to discover cl...

Please sign up or login with your details

Forgot password? Click here to reset