Benchmarking a Proof-of-Concept Performance Portable SYCL-based Fast Fourier Transformation Library

03/17/2022
by   Vincent R. Pascuzzi, et al.
0

In this paper, we present an early version of a SYCL-based FFT library, capable of running on all major vendor hardware, including CPUs and GPUs from AMD, ARM, Intel and NVIDIA. Although preliminary, the aim of this work is to seed further developments for a rich set of features for calculating FFTs. It has the advantage over existing portable FFT libraries in that it is single-source, and therefore removes the complexities that arise due to abundant use of pre-process macros and auto-generated kernels to target different architectures. We exercise two SYCL-enabled compilers, Codeplay ComputeCpp and Intel's open-source LLVM project, to evaluate performance portability of our SYCL-based FFT on various heterogeneous architectures. The current limitations of our library is it supports single-dimension FFTs up to 2^11 in length and base-2 input sequences. We compare our results with highly optimized vendor specific FFT libraries and provide a detailed analysis to demonstrate a fair level of performance, as well as potential sources of performance bottlenecks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/18/2021

Porting a sparse linear algebra math library to Intel GPUs

With the announcement that the Aurora Supercomputer will be composed of ...
research
04/18/2017

LibOPT: An Open-Source Platform for Fast Prototyping Soft Optimization Techniques

Optimization techniques play an important role in several scientific and...
research
08/01/2016

A survey of sparse matrix-vector multiplication performance on large matrices

We contribute a third-party survey of sparse matrix-vector (SpMV) produc...
research
10/18/2017

Wilson and Domainwall Kernels on Oakforest-PACS

We report the performance of Wilson and Domainwall Kernels on a new Inte...
research
08/16/2019

ArborX: A Performance Portable Search Library

Searching for geometric objects that are close in space is a fundamental...
research
04/25/2022

Investigating Black-Box Function Recognition Using Hardware Performance Counters

This paper presents new methods and results for learning information abo...
research
08/16/2019

ArborX: A Performance Portable Geometric Search Library

Searching for geometric objects that are close in space is a fundamental...

Please sign up or login with your details

Forgot password? Click here to reset