Alternative quadrant representations with Morton index and AVX2 vectorization for AMR algorithms within the p4est software library

08/24/2023
by   Mikhail Kirilin, et al.
0

We present a technical enhancement within the p4est software for parallel adaptive mesh refinement. In p4est primitives are stored as octants in three and quadrants in two dimensions. While, classically, they are encoded by the native approach using its spatial and refinement level, any other mathematically equivalent encoding might be used instead. Recognizing this, we add two alternative representations to the classical, explicit version, based on a long monotonic index and 128-bit AVX quad integers, respectively. The first one requires changes in logic for low-level quadrant manipulating algorithms, while the other exploits data level parallelism and requires algorithms to be adapted to SIMD instructions. The resultant algorithms and data structures lead to higher performance and lesser memory usage in comparison with the standard baseline. We benchmark selected algorithms on a cluster with two Intel(R) Xeon(R) Gold 6130 Skylake family CPUs per node, which provides support for AVX2 extensions, 192 GB RAM per node, and up to 512 computational cores in total.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/13/2018

Scalable Algorithms for Parallel Tree-based Adaptive Mesh Refinement with General Element Types

In this thesis, we develop, discuss and implement algorithms for scalabl...
research
02/22/2017

Enhancing speed and scalability of the ParFlow simulation code

Regional hydrology studies are often supported by high resolution simula...
research
06/28/2021

Combinatorial BLAS 2.0: Scaling combinatorial algorithms on distributed-memory systems

Combinatorial algorithms such as those that arise in graph analysis, mod...
research
12/07/2021

Parallel Discrete Convolutions on Adaptive Particle Representations of Images

We present data structures and algorithms for native implementations of ...
research
07/08/2019

A generic finite element framework on parallel tree-based adaptive meshes

In this work we formally derive and prove the correctness of the algorit...
research
04/12/2016

BoxLib with Tiling: An AMR Software Framework

In this paper we introduce a block-structured adaptive mesh refinement (...
research
01/31/2018

Cataloging the Visible Universe through Bayesian Inference at Petascale

Astronomical catalogs derived from wide-field imaging surveys are an imp...

Please sign up or login with your details

Forgot password? Click here to reset