MSREP: A Fast yet Light Sparse Matrix Framework for Multi-GPU Systems

09/15/2022
by   Jieyang Chen, et al.
0

Sparse linear algebra kernels play a critical role in numerous applications, covering from exascale scientific simulation to large-scale data analytics. Offloading linear algebra kernels on one GPU will no longer be viable in these applications, simply because the rapidly growing data volume may exceed the memory capacity and computing power of a single GPU. Multi-GPU systems nowadays being ubiquitous in supercomputers and data-centers present great potentials in scaling up large sparse linear algebra kernels. In this work, we design a novel sparse matrix representation framework for multi-GPU systems called MSREP, to scale sparse linear algebra operations based on our augmented sparse matrix formats in a balanced pattern. Different from dense operations, sparsity significantly intensifies the difficulty of distributing the computation workload among multiple GPUs in a balanced manner. We enhance three mainstream sparse data formats – CSR, CSC, and COO, to enable fine-grained data distribution. We take sparse matrix-vector multiplication (SpMV) as an example to demonstrate the efficiency of our MSREP framework. In addition, MSREP can be easily extended to support other sparse linear algebra kernels based on the three fundamental formats (i.e., CSR, CSC and COO).

READ FULL TEXT

page 3

page 6

page 9

page 10

research
11/16/2020

Indirection Stream Semantic Register Architecture for Efficient Sparse-Dense Linear Algebra

Sparse-dense linear algebra is crucial in many domains, but challenging ...
research
12/13/2020

Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures

Designing efficient and scalable sparse linear algebra kernels on modern...
research
11/17/2020

Audience Creation for Consumables – Simple and Scalable Precision Merchandising for a Growing Marketplace

Consumable categories, such as grocery and fast-moving consumer goods, a...
research
09/17/2018

AlSub: Fully Parallel and Modular Subdivision

In recent years, mesh subdivision---the process of forging smooth free-f...
research
12/10/2022

Optimized Sparse Matrix Operations for Reverse Mode Automatic Differentiation

Sparse matrix representations are ubiquitous in computational science an...
research
06/26/2021

GSmart: An Efficient SPARQL Query Engine Using Sparse Matrix Algebra – Full Version

Efficient execution of SPARQL queries over large RDF datasets is a topic...
research
09/17/2018

AlSub: Fully Parallel Subdivision for Modeling and Rendering

Mesh subdivision is a key geometric modeling task which forges smooth, s...

Please sign up or login with your details

Forgot password? Click here to reset