Decoupled Block-Wise ILU(k) Preconditioner on GPU

03/03/2017
by   Hui Liu, et al.
0

This research investigates the implementation mechanism of block-wise ILU(k) preconditioner on GPU. The block-wise ILU(k) algorithm requires both the level k and the block size to be designed as variables. A decoupled ILU(k) algorithm consists of a symbolic phase and a factorization phase. In the symbolic phase, a ILU(k) nonzero pattern is established from the point-wise structure extracted from a block-wise matrix. In the factorization phase, the block-wise matrix with a variable block size is factorized into a block lower triangular matrix and a block upper triangular matrix. And a further diagonal factorization is required to perform on the block upper triangular matrix for adapting a parallel triangular solver on GPU.We also present the numerical experiments to study the preconditioner actions on different k levels and block sizes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/02/2019

BMF: Block matrix approach to factorization of large scale data

Matrix Factorization (MF) on large scale matrices is computationally as ...
research
10/20/2020

On Application of Block Kaczmarz Methods in Matrix Factorization

Matrix factorization techniques compute low-rank product approximations ...
research
08/12/2022

Parallel QR Factorization of Block Low-Rank Matrices

We present two new algorithms for Householder QR factorization of Block ...
research
02/26/2018

Symmetric indefinite triangular factorization revealing the rank profile matrix

We present a novel recursive algorithm for reducing a symmetric matrix t...
research
07/02/2020

GSoFa: Scalable Sparse LU Symbolic Factorization on GPUs

Decomposing a matrix A into a lower matrix L and an upper matrix U, whic...
research
02/21/2023

LU-Net: Invertible Neural Networks Based on Matrix Factorization

LU-Net is a simple and fast architecture for invertible neural networks ...
research
01/22/2016

Task Parallel Incomplete Cholesky Factorization using 2D Partitioned-Block Layout

We introduce a task-parallel algorithm for sparse incomplete Cholesky fa...

Please sign up or login with your details

Forgot password? Click here to reset