Enhancing data locality of the conjugate gradient method for high-order matrix-free finite-element implementations

05/18/2022
by   Martin Kronbichler, et al.
0

This work investigates a variant of the conjugate gradient (CG) method and embeds it into the context of high-order finite-element schemes with fast matrix-free operator evaluation and cheap preconditioners like the matrix diagonal. Relying on a data-dependency analysis and appropriate enumeration of degrees of freedom, we interleave the vector updates and inner products in a CG iteration with the matrix-vector product with only minor organizational overhead. As a result, around 90 vectors of the CG method are transferred from slow RAM memory exactly once per iteration, with all additional access hitting fast cache memory. Node-level performance analyses and scaling studies on up to 147k cores show that the CG method with the proposed performance optimizations is around two times faster than a standard CG solver as well as optimized pipelined CG and s-step CG methods for large sizes that exceed processor caches, and provides similar performance near the strong scaling limit.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2019

Algorithms and data structures for matrix-free finite element operators with MPI-parallel sparse multi-vectors

Traditional solution approaches for problems in quantum mechanics scale ...
research
09/20/2017

A stencil scaling approach for accelerating matrix-free finite element implementations

We present a novel approach to fast on-the-fly low order finite element ...
research
08/11/2023

Conforming Finite Element Function Spaces in Four Dimensions, Part 1: Foundational Principles and the Tesseract

The stability, robustness, accuracy, and efficiency of space-time finite...
research
08/18/2023

Performant low-order matrix-free finite element kernels on GPU architectures

Numerical methods such as the Finite Element Method (FEM) have been succ...
research
10/21/2022

End-to-end GPU acceleration of low-order-refined preconditioning for high-order finite element discretizations

In this paper, we present algorithms and implementations for the end-to-...
research
10/24/2017

Implicit Low-Order Unstructured Finite-Element Multiple Simulation Enhanced by Dense Computation using OpenACC

In this paper, we develop a low-order three-dimensional finite-element s...
research
02/02/2020

Condensed Generalized Finite Element Method (CGFEM)

Generalized or extended finite element methods (GFEM/XFEM) are in genera...

Please sign up or login with your details

Forgot password? Click here to reset