GLU3.0: Fast GPU-based Parallel Sparse LU Factorization for Circuit Simulation

08/01/2019
by   Shaoyi Peng, et al.
0

In this article, we propose a new GPU-based sparse LU factorization method, called GLU3.0, solves the aforementioned problems. First, it introduces a much more efficient double-U dependency detection algorithm to make the detection much simpler. Second, we observe that the potential parallelism is different as the matrix factorization goes on. We then develop three different modes of GPU kernel to adapt to different stages to accommodate the computing task changes in the factorization. As a result, the new GLU can dynamically allocate GPU blocks and wraps based on the number of columns in a level to better balance the computing demands and resources during the LU factorization process. Experimental results on circuit matrices from University of Florida Sparse Matrix Collection (UFL) show that the GLU3.0 can deliver 2-3 orders of magnitude speedup over GLU2.0 for the data dependency detection. Furthermore, GLU3.0 achieve 13.0X (arithmetic mean) and 6.7X (geometric mean) speedup over GLU2.0 and 7.1X (arithmetic mean) and 4.8X (geometric mean) over the recently proposed enhanced GLU2.0 sparse LU solver on the same set of circuit matrices.

READ FULL TEXT

page 1

page 5

research
01/22/2016

Task Parallel Incomplete Cholesky Factorization using 2D Partitioned-Block Layout

We introduce a task-parallel algorithm for sparse incomplete Cholesky fa...
research
02/03/2016

Inv-ASKIT: A Parallel Fast Diret Solver for Kernel Matrices

We present a parallel algorithm for computing the approximate factorizat...
research
03/10/2023

Multivariate to Bivariate Reduction for Noncommutative Polynomial Factorization

Based on a theorem of Bergman we show that multivariate noncommutative p...
research
08/23/2022

Scalable Linear Time Dense Direct Solver for 3-D Problems Without Trailing Sub-Matrix Dependencies

Factorization of large dense matrices are ubiquitous in engineering and ...
research
08/03/2022

A Hybrid Factorization Algorithm for Sparse Matrix with Mixed Precision Arithmetic

A new hybrid algorithm for LDU-factorization for large sparse matrix com...
research
03/07/2013

On the minimum FLOPs problem in the sparse Cholesky factorization

Prior to computing the Cholesky factorization of a sparse, symmetric pos...
research
06/12/2022

A Graph Transformation Strategy for Optimizing SpTRSV

Sparse triangular solve (SpTRSV) is an extensively studied computational...

Please sign up or login with your details

Forgot password? Click here to reset