Temporal Vectorization for Stencils

10/10/2020
by   Liang Yuan, et al.
0

Stencil computations represent a very common class of nested loops in scientific and engineering applications. Exploiting vector units in modern CPUs is crucial to achieving peak performance. Previous vectorization approaches often consider the data space, in particular the innermost unit-strided loop. It leads to the well-known data alignment conflict problem that vector loads are overlapped due to the data sharing between continuous stencil computations. This paper proposes a novel temporal vectorization scheme for stencils. It vectorizes the stencil computation in the iteration space and assembles points with different time coordinates in one vector. The temporal vectorization leads to a small fixed number of vector reorganizations that is irrelevant to the vector length, stencil order, and dimension. Furthermore, it is also applicable to Gauss-Seidel stencils, whose vectorization is not well-studied. The effectiveness of the temporal vectorization is demonstrated by various Jacobi and Gauss-Seidel stencils.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/16/2021

Reducing Redundancy in Data Organization and Arithmetic Calculation for Stencil Computations

Stencil computation is one of the most important kernels in various scie...
research
03/16/2021

An Efficient Vectorization Scheme for Stencil Computation

Stencil computation is one of the most important kernels in various scie...
research
12/12/2021

Temporal Unit Interval Independent Sets

Temporal graphs have been recently introduced to model changes to a give...
research
06/03/2018

k-Space Deep Learning for Parallel MRI: Application to Time-Resolved MR Angiography

Time-resolved angiography with interleaved stochastic trajectories (TWIS...
research
08/17/2023

Approximating Clustering for Memory Management and request processing

Clustering is a crucial tool for analyzing data in virtually every scien...
research
02/07/2023

OPORP: One Permutation + One Random Projection

Consider two D-dimensional data vectors (e.g., embeddings): u, v. In man...
research
01/21/2020

Lattice QCD on a novel vector architecture

The SX-Aurora TSUBASA PCIe accelerator card is the newest model of NEC's...

Please sign up or login with your details

Forgot password? Click here to reset