Distributed Parallelization of xPU Stencil Computations in Julia

11/28/2022

∙

We present a straightforward approach for distributed parallelization of stencil-based xPU applications on a regular staggered grid, which is instantiated in the package ImplicitGlobalGrid.jl. The approach allows to leverage remote direct memory access and enables close to ideal weak scaling of real-world applications on thousands of GPUs. The communication costs can be easily hidden behind computation.

READ FULL TEXT

Distributed Parallelization of xPU Stencil Computations in Julia

Sign in with Google

Consider DeepAI Pro