Massively scalable stencil algorithm

04/07/2022
by   Mathias Jacquelin, et al.
0

Stencil computations lie at the heart of many scientific and industrial applications. Unfortunately, stencil algorithms perform poorly on machines with cache based memory hierarchy, due to low re-use of memory accesses. This work shows that for stencil computation a novel algorithm that leverages a localized communication strategy effectively exploits the Cerebras WSE-2, which has no cache hierarchy. This study focuses on a 25-point stencil finite-difference method for the 3D wave equation, a kernel frequently used in earth modeling as numerical simulation. In essence, the algorithm trades memory accesses for data communication and takes advantage of the fast communication fabric provided by the architecture. The algorithm – historically memory bound – becomes compute bound. This allows the implementation to achieve near perfect weak scaling, reaching up to 503 TFLOPs on WSE-2, a figure that only full clusters can eventually yield.

READ FULL TEXT
research
07/07/2017

Duty to Delete on Non-Volatile Memory

We firstly suggest new cache policy applying the duty to delete invalid ...
research
12/28/2021

Casper: Accelerating Stencil Computation using Near-cache Processing

Stencil computation is one of the most used kernels in a wide variety of...
research
09/30/2016

GPU Acceleration of Hermite Methods for the Simulation of Wave Propagation

The Hermite methods of Goodrich, Hagstrom, and Lorenz (2006) use Hermite...
research
03/22/2010

Proficient Pair of Replacement Algorithms on L1 and L2 Cache for Merge Sort

Memory hierarchy is used to compete the processors speed. Cache memory i...
research
04/21/2023

Massively Distributed Finite-Volume Flux Computation

Designing large-scale geological carbon capture and storage projects and...
research
01/24/2021

A Survey of Novel Cache Hierarchy Designs for High Workloads

Traditional on-die, three-level cache hierarchy design is very commonly ...
research
10/01/2020

Modelling the earth's geomagnetic environment on Cray machines using PETSc and SLEPc

The British Geological Survey's global geomagnetic model, Model of the E...

Please sign up or login with your details

Forgot password? Click here to reset