Lightweight Parallel Foundations: a model-compliant communication layer

06/07/2019
by   Wijnand Suijlen, et al.
0

We present the Lightweight Parallel Foundations (LPF), an interoperable and model-compliant communication layer adhering to a strict performance model of parallel computations. LPF consists of twelve primitives, each with strict performance guarantees, two of which enable interoperability. We argue that the principles of interoperability and model compliance suffice for the practical use of immortal algorithms: algorithms that are proven optimal once, and valid forever. These are ideally also implemented once, and usable from a wide range of sequential and parallel environments. This paradigm is evaluated by implementing an immortal fast Fourier transform (FFT) using LPF, and compared to state-of-the-art FFT implementations. We find it performs on par to Intel MKL FFT while consistently outperforming FFTW, thus showing model compliance can be achieved without sacrificing performance. Interoperability encourages the propagation of immortal algorithms as widely as possible. We evaluate this by integrating an LPF PageRank into Spark, without changing any PageRank nor Spark source codes, and while requiring only a minimal interface layer.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/12/2020

CROFT: A scalable three-dimensional parallel Fast Fourier Transform (FFT) implementation for High Performance Clusters

The FFT of three-dimensional (3D) input data is an important computation...
research
02/09/2020

Large-Scale Discrete Fourier Transform on TPUs

In this work, we present two parallel algorithms for the large-scale dis...
research
11/27/2017

2D Image Convolution using Three Parallel Programming Models on the Xeon Phi

Image convolution is widely used for sharpening, blurring and edge detec...
research
08/30/2020

Low-Depth Parallel Algorithms for the Binary-Forking Model without Atomics

The binary-forking model is a parallel computation model, formally defin...
research
12/14/2020

Group Communication with Context Codec for Ultra-Lightweight Source Separation

Ultra-lightweight model design is an important topic for the deployment ...
research
07/08/2023

Parallel Algorithms Align with Neural Execution

Neural algorithmic reasoners are parallel processors. Teaching them sequ...
research
10/20/2020

Optimising the Performance of Convolutional Neural Networks across Computing Systems using Transfer Learning

The choice of convolutional routines (primitives) to implement neural ne...

Please sign up or login with your details

Forgot password? Click here to reset