Hierarchical Block Multi-Color Ordering: A New Parallel Ordering Method for Vectorization and Parallelization of the Sparse Triangular Solver in the ICCG Method

08/02/2019
by   Takeshi Iwashita, et al.
0

In this paper, we propose a new parallel ordering method to vectorize and parallelize the sparse triangular solver, which is called hierarchical block multi-color ordering. In this method, the parallel forward and backward substitutions can be vectorized while preserving the advantages of block multi-color ordering, that is, fast convergence and fewer thread synchronizations. To evaluate the proposed method in a parallel ICCG (Incomplete Cholesky Conjugate Gradient) solver, numerical tests were conducted using five test matrices on three types of computational nodes. The numerical results indicate that the proposed method outperforms the conventional block and nodal multi-color ordering methods in 13 out of 15 test cases, which confirms the effectiveness of the method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/02/2019

An asynchronous incomplete block LU preconditioner for computational fluid dynamics on unstructured grids

We present a study of the effectiveness of asynchronous incomplete LU fa...
research
10/31/2017

Beyond Shared Hierarchies: Deep Multitask Learning through Soft Layer Ordering

Existing deep multitask learning (MTL) approaches align layers shared be...
research
07/12/2019

Equal bi-Vectorized (EBV) method to high performance on GPU

Due to importance of reducing of time solution in numerical codes, we pr...
research
03/07/2022

Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences

Parallelizing Gated Recurrent Unit (GRU) networks is a challenging task,...
research
02/27/2020

Usual stochastic ordering results for series and parallel systems with components having Exponentiated Chen distribution

In this paper, we have discussed the usual stochastic ordering relations...
research
08/18/2023

On Block Cholesky Decomposition for Sparse Inverse Covariance Estimation

The modified Cholesky decomposition is popular for inverse covariance es...
research
07/31/2019

Testing performance with and without Block Low Rank Compression in MUMPS and the new PaStiX 6.0 for JOREK nonlinear MHD simulations

The interface to the MUMPS solver was updated in the JOREK MHD code to s...

Please sign up or login with your details

Forgot password? Click here to reset