Performance Analysis and Optimal Node-Aware Communication for Enlarged Conjugate Gradient Methods

03/11/2022
by   Shelby Lockhart, et al.
0

Krylov methods are a key way of solving large sparse linear systems of equations, but suffer from poor strong scalabilty on distributed memory machines. This is due to high synchronization costs from large numbers of collective communication calls alongside a low computational workload. Enlarged Krylov methods address this issue by decreasing the total iterations to convergence, an artifact of splitting the initial residual and resulting in operations on block vectors. In this paper, we present a performance study of an Enlarged Krylov Method, Enlarged Conjugate Gradients (ECG), noting the impact of block vectors on parallel performance at scale. Most notably, we observe the increased overhead of point-to-point communication as a result of denser messages in the sparse matrix-block vector multiplication kernel. Additionally, we present models to analyze expected performance of ECG, as well as, motivate design decisions. Most importantly, we introduce a new point-to-point communication approach based on node-aware communication techniques that increases efficiency of the method at scale.

READ FULL TEXT
research
12/23/2016

Node Aware Sparse Matrix-Vector Multiplication

The sparse matrix-vector multiply (SpMV) operation is a key computationa...
research
06/06/2018

Improving Performance Models for Irregular Point-to-Point Communication

Parallel applications are often unable to take full advantage of emergin...
research
12/26/2019

Strategies for the vectorized Block Conjugate Gradients method

Block Krylov methods have recently gained a lot of attraction. Due to th...
research
05/09/2022

The Role of Idle Waves, Desynchronization, and Bottleneck Evasion in the Performance of Parallel Programs

The performance of highly parallel applications on distributed-memory sy...
research
09/13/2022

Characterizing the Performance of Node-Aware Strategies for Irregular Point-to-Point Communication on Heterogeneous Architectures

Supercomputer architectures are trending toward higher computational thr...
research
04/06/2021

Hardware-Oriented Krylov Methods for High-Performance Computing

Krylov subspace methods are an essential building block in numerical sim...
research
02/22/2016

Preconditioning Kernel Matrices

The computational and storage complexity of kernel machines presents the...

Please sign up or login with your details

Forgot password? Click here to reset