Low-Synch Gram-Schmidt with Delayed Reorthogonalization for Krylov Solvers

04/02/2021
by   Daniel Bielich, et al.
0

The parallel strong-scaling of Krylov iterative methods is largely determined by the number of global reductions required at each iteration. The GMRES and Krylov-Schur algorithms employ the Arnoldi algorithm for nonsymmetric matrices. The underlying orthogonalization scheme is left-looking and processes one column at a time. Thus, at least one global reduction is required per iteration. The traditional algorithm for generating the orthogonal Krylov basis vectors for the Krylov-Schur algorithm is classical Gram Schmidt applied twice with reorthogonalization (CGS2), requiring three global reductions per step. A new variant of CGS2 that requires only one reduction per iteration is applied to the Arnoldi-QR iteration. Strong-scaling results are presented for finding eigenvalue-pairs of nonsymmetric matrices. A preliminary attempt to derive a similar algorithm (one reduction per Arnoldi iteration with a robust orthogonalization scheme) was presented by Hernandez et al.(2007). Unlike our approach, their method is not forward stable for eigenvalues.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2021

Performance of Low Synchronization Orthogonalization Methods in Anderson Accelerated Fixed Point Solvers

Anderson Acceleration (AA) is a method to accelerate the convergence of ...
research
01/23/2023

Augmented Block-Arnoldi Recycling CFD Solvers

One of the limitations of recycled GCRO methods is the large amount of c...
research
05/16/2022

Post-Modern GMRES

The GMRES algorithm of Saad and Schultz (1986) for nonsymmetric linear s...
research
12/11/2018

A Non-iterative Parallelizable Eigenbasis Algorithm for Johnson Graphs

We present a new O(k^2 nk^2) method for generating an orthogonal basis o...
research
06/03/2019

A Direct Õ(1/ε) Iteration Parallel Algorithm for Optimal Transport

Optimal transportation, or computing the Wasserstein or “earth mover's” ...
research
05/04/2019

New communication hiding conjugate gradient variants

The conjugate gradient algorithm suffers from communication bottlenecks ...
research
11/18/2021

ILU Smoothers for Low Mach Navier-Stokes Pressure Solvers

Incomplete LU (ILU) smoothers are effective in the algebraic multigrid (...

Please sign up or login with your details

Forgot password? Click here to reset