Parallelization and scalability analysis of inverse factorization using the Chunks and Tasks programming model

01/23/2019
by   Anton G. Artemov, et al.
0

We present three methods for distributed memory parallel inverse factorization of block-sparse Hermitian positive definite matrices. The three methods are a recursive variant of the AINV inverse Cholesky algorithm, iterative refinement, and localized inverse factorization, respectively. All three methods are implemented using the Chunks and Tasks programming model, building on the distributed sparse quad-tree matrix representation and parallel matrix-matrix multiplication in the publicly available Chunks and Tasks Matrix Library (CHTML). Although the algorithms are generally applicable, this work was mainly motivated by the need for efficient and scalable inverse factorization of the basis set overlap matrix in large scale electronic structure calculations. We perform various computational tests on overlap matrices for quasi-linear Glutamic Acid-Alanine molecules and three-dimensional water clusters discretized using the standard Gaussian basis set STO-3G with up to more than 10 million basis functions. We show that for such matrices the computational cost increases only linearly with system size for all the three methods. We show both theoretically and in numerical experiments that the methods based on iterative refinement and localized inverse factorization outperform previous parallel implementations in weak scaling tests where the system size is increased in direct proportion to the number of processes. We show also that compared to the method based on pure iterative refinement the localized inverse factorization requires much less communication.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/19/2019

Sparse approximate matrix multiplication in a fully recursive distributed task-based parallel framework

In this paper we consider parallel implementations of approximate multip...
research
11/23/2020

The Chunks and Tasks Matrix Library 2.0

We present a C++ header-only parallel sparse matrix library, based on sp...
research
11/08/2020

LDU factorization

LU-factorization of matrices is one of the fundamental algorithms of lin...
research
05/20/2020

Sparse approximate matrix-matrix multiplication with error control

We propose a method for strict error control in sparse approximate matri...
research
02/18/2022

Optimization of the Sparse Multi-Threaded Cholesky Factorization for A64FX

Sparse linear algebra routines are fundamental building blocks of a larg...
research
07/01/2016

Distributed-memory Hierarchical Interpolative Factorization

The hierarchical interpolative factorization (HIF) offers an efficient w...
research
10/11/2019

Optimal Approximation of Doubly Stochastic Matrices

We consider the least-squares approximation of a matrix C in the set of ...

Please sign up or login with your details

Forgot password? Click here to reset