A Two-level GPU-Accelerated Incomplete LU Preconditioner for General Sparse Linear Systems

03/15/2023
by   Tianshi Xu, et al.
0

This paper presents a parallel preconditioning approach based on incomplete LU (ILU) factorizations in the framework of Domain Decomposition (DD) for general sparse linear systems. We focus on distributed memory parallel architectures, specifically, those that are equipped with graphic processing units (GPUs). In addition to block Jacobi, we present general purpose two-level ILU Schur complement-based approaches, where different strategies are presented to solve the coarse-level reduced system. These strategies are combined with modified ILU methods in the construction of the coarse-level operator, in order to effectively remove smooth errors. We leverage available GPU-based sparse matrix kernels to accelerate the setup and the solve phases of the proposed ILU preconditioner. We evaluate the efficiency of the proposed methods as a smoother for algebraic multigrid (AMG) and as a preconditioner for Krylov subspace methods, on challenging anisotropic diffusion problems and a collection of general sparse matrices.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/10/2023

An Experimental Study of Two-Level Schwarz Domain Decomposition Preconditioners on GPUs

The generalized Dryja–Smith–Widlund (GDSW) preconditioner is a two-level...
research
09/01/2021

Accelerating an Iterative Eigensolver for Nuclear Structure Configuration Interaction Calculations on GPUs using OpenACC

To accelerate the solution of large eigenvalue problems arising from man...
research
10/10/2019

DBCSR: A Library for Dense Matrix Multiplications on Distributed GPU-Accelerated Systems

Most, if not all the modern scientific simulation packages utilize matri...
research
02/12/2020

Algebraic multigrid block preconditioning for multi-group radiation diffusion equations

The paper focuses on developing and studying efficient block preconditio...
research
05/04/2022

parGeMSLR: A Parallel Multilevel Schur Complement Low-Rank Preconditioning and Solution Package for General Sparse Matrices

This paper discusses parGeMSLR, a C++/MPI software library for the solut...
research
05/21/2021

Contention-Aware GPU Partitioning and Task-to-Partition Allocation for Real-Time Workloads

In order to satisfy timing constraints, modern real-time applications re...
research
10/14/2021

Tuning Spectral Element Preconditioners for Parallel Scalability on GPUs

The Poisson pressure solve resulting from the spectral element discretiz...

Please sign up or login with your details

Forgot password? Click here to reset