Feature-based SpMV Performance Analysis on Contemporary Devices

02/08/2023
by   Panagiotis Mpakos, et al.
0

The SpMV kernel is characterized by high performance variation per input matrix and computing platform. While GPUs were considered State-of-the-Art for SpMV, with the emergence of advanced multicore CPUs and low-power FPGA accelerators, we need to revisit its performance and energy efficiency. This paper provides a high-level SpMV performance analysis based on structural features of matrices related to common bottlenecks of memory-bandwidth intensity, low ILP, load imbalance and memory latency overheads. Towards this, we create a wide artificial matrix dataset that spans these features and study the performance of different storage formats in nine modern HPC platforms; five CPUs, three GPUs and an FPGA. After validating our proposed methodology using real-world matrices, we analyze our extensive experimental results and draw key insights on the competitiveness of different target architectures for SpMV and the impact of each feature/bottleneck on its performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2022

Virtual Screening on FPGA: Performance and Energy versus Effort

With their widespread availability, FPGA-based accelerators cards have b...
research
10/31/2019

Direct N-body application on low-power and energy-efficient parallel architectures

The aim of this work is to quantitatively evaluate the impact of computa...
research
10/20/2020

Revisiting Huffman Coding: Toward Extreme Performance on Modern GPU Architectures

Today's high-performance computing (HPC) applications are producing vast...
research
03/04/2019

Performance Analysis of Effective Symbolic Methods for Solving Band Matrix SLAEs

This paper presents an experimental performance study of implementations...
research
10/18/2020

Optimizing Memory Performance of Xilinx FPGAs under Vitis

Plenty of research efforts have been devoted to FPGA-based acceleration,...
research
05/16/2023

Case Study for Running Memory-Bound Kernels on RISC-V CPUs

The emergence of a new, open, and free instruction set architecture, RIS...
research
05/09/2020

Benchmarking High Bandwidth Memory on FPGAs

FPGAs are starting to be enhanced with High Bandwidth Memory (HBM) as a ...

Please sign up or login with your details

Forgot password? Click here to reset