A New High Performance and Scalable SVD algorithm on Distributed Memory Systems

06/16/2018
by   Shengguo Li, et al.
0

This paper introduces a high performance implementation of Zolo-SVD algorithm on distributed memory systems, which is based on the polar decomposition (PD) algorithm via the Zolotarev's function (Zolo-PD), originally proposed by Nakatsukasa and Freund [SIAM Review, 2016]. Our implementation highly relies on the routines of ScaLAPACK and therefore it is portable. Compared with the other PD algorithms such as the QR-based dynamically weighted Halley method (QDWH-PD), Zolo-PD is naturally parallelizable and has better scalability though performs more floating-point operations. When using many processes, Zolo-PD is usually 1.20 times faster than QDWH-PD algorithm, and Zolo-SVD can be about two times faster than the ScaLAPACK routine PDGESVD. These numerical experiments are performed on Tianhe-2 supercomputer, one of the fastest supercomputers in the world, and the tested matrices include some sparse matrices from particular applications and some randomly generated dense matrices with different dimensions. Our QDWH-SVD and Zolo-SVD implementations are freely available at https://github.com/shengguolsg/Zolo-SVD.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/25/2017

Computing low-rank approximations of large-scale matrices with the Tensor Network randomized SVD

We propose a new algorithm for the computation of a singular value decom...
research
04/29/2021

High-Performance Partial Spectrum Computation for Symmetric eigenvalue problems and the SVD

Current dense symmetric eigenvalue (EIG) and singular value decompositio...
research
08/17/2022

Distributed Out-of-Memory SVD on CPU/GPU Architectures

We propose an efficient, distributed, out-of-memory implementation of th...
research
09/10/2022

A mixed precision Jacobi SVD algorithm

We propose a mixed precision Jacobi algorithm for computing the singular...
research
10/22/2021

GPU-Accelerated Forward-Backward algorithm with Application to Lattice-Free MMI

We propose to express the forward-backward algorithm in terms of operati...
research
05/01/2019

High-performance sampling of generic Determinantal Point Processes

Determinantal Point Processes (DPPs) were introduced by Macchi as a mode...
research
12/11/2022

Orthogonal SVD Covariance Conditioning and Latent Disentanglement

Inserting an SVD meta-layer into neural networks is prone to make the co...

Please sign up or login with your details

Forgot password? Click here to reset