Subdomain Deflation Combined with Local AMG: a Case Study Using AMGCL Library

10/11/2017
by   Denis Demidov, et al.
0

The paper proposes a combination of the subdomain deflation method and local algebraic multigrid as a scalable distributed memory preconditioner that is able to solve large linear systems of equations. The implementation of the algorithm is made available for the community as part of an open source AMGCL library. The solution targets both homogeneous (CPU-only) and heterogeneous (CPU/GPU) systems, employing hybrid MPI/OpenMP approach in the former and a combination of MPI, OpenMP, and CUDA in the latter cases. The use of OpenMP minimizes the number of MPI processes, thus reducing the communication overhead of the deflation method and improving both weak and strong scalability of the preconditioner. The examples of scalar, Poisson-like, systems as well as non-scalar problems, stemming out of the discretization of the Navier-Stokes equations, are considered in order to estimate performance of the implemented algorithm. A comparison with a traditional global AMG preconditioner based on a well-established Trilinos ML package is provided.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/11/2017

Subdomain Deflation and Algebraic Multigrid: Combining Multiscale with Multilevel

The paper proposes a combination of the subdomain deflation method and l...
research
11/23/2015

Developing a High Performance Software Library with MPI and CUDA for Matrix Computations

Nowadays, the paradigm of parallel computing is changing. CUDA is now a ...
research
03/04/2023

Multi-GPU aggregation-based AMG preconditioner for iterative linear solvers

We present and release in open source format a sparse linear solver whic...
research
05/15/2019

Improving strong scaling of the Conjugate Gradient method for solving large linear systems using global reduction pipelining

This paper presents performance results comparing MPI-based implementati...
research
03/25/2020

A Hybrid MPI+Threads Approach to Particle Group Finding Using Union-Find

The Friends-of-Friends (FoF) algorithm is a standard technique used in c...
research
10/26/2020

Parallelizing multiple precision Taylor series method for integrating the Lorenz system

A hybrid MPI+OpenMP strategy for parallelizing multiple precision Taylor...
research
05/21/2022

MapReduce for Counting Word Frequencies with MPI and GPUs

In this project, the goal was to use the Julia programming language and ...

Please sign up or login with your details

Forgot password? Click here to reset