Accelerating Geometric Multigrid Preconditioning with Half-Precision Arithmetic on GPUs

07/15/2020
by   Kyaw L. Oo, et al.
0

With the hardware support for half-precision arithmetic on NVIDIA V100 GPUs, high-performance computing applications can benefit from lower precision at appropriate spots to speed up the overall execution time. In this paper, we investigate a mixed-precision geometric multigrid method to solve large sparse systems of equations stemming from discretization of elliptic PDEs. While the final solution is always computed with high-precision accuracy, an iterative refinement approach with multigrid preconditioning in lower precision and residuum scaling is employed. We compare the FP64 baseline for Poisson's equation to purely FP16 multigrid preconditioning and to the employment of FP16-FP32-FP64 combinations within a mesh hierarchy. While the iteration count is almost not affected by using lower accuracy, the solver runtime is considerably decreased due to the reduced memory transfer and a speedup of up to 2.5x is gained for the overall solver. We investigate the performance of selected kernels with the hierarchical Roofline model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/02/2021

A Study of Mixed Precision Strategies for GMRES on GPUs

Support for lower precision computation is becoming more common in accel...
research
08/03/2022

A Hybrid Factorization Algorithm for Sparse Matrix with Mixed Precision Arithmetic

A new hybrid algorithm for LDU-factorization for large sparse matrix com...
research
05/16/2021

Experimental Evaluation of Multiprecision Strategies for GMRES on GPUs

Support for lower precision computation is becoming more common in accel...
research
03/30/2021

Mixed-precision for Linear Solvers in Global Geophysical Flows

Semi-implicit time-stepping schemes for atmosphere and ocean models requ...
research
09/25/2020

Compressed Basis GMRES on High Performance GPUs

Krylov methods provide a fast and highly parallel numerical tool for the...
research
04/23/2021

tcFFT: Accelerating Half-Precision FFT through Tensor Cores

Fast Fourier Transform (FFT) is an essential tool in scientific and engi...
research
12/24/2020

Perturbed Runge-Kutta methods for mixed precision applications

In this work we consider a mixed precision approach to accelerate the im...

Please sign up or login with your details

Forgot password? Click here to reset