CROFT: A scalable three-dimensional parallel Fast Fourier Transform (FFT) implementation for High Performance Clusters

02/12/2020
by   Vivek Gavane, et al.
0

The FFT of three-dimensional (3D) input data is an important computational kernel of numerical simulations and is widely used in High Performance Computing (HPC) codes running on a large number of processors. Performance of many scientific applications such as Molecular Dynamic simulations depends on the underlying 3D parallel FFT library being used. In this paper, we present C-DACs three-dimensional Fast Fourier Transform (CROFT) library which implements three-dimensional parallel FFT using pencil decomposition. To exploit the hyperthreading capabilities of processor cores without affecting performance, CROFT is designed to use multithreading along with MPI. CROFT implementation has an innovative feature of overlapping compute and memory-I/O with MPI communication using multithreading. As opposed to other 3D FFT implementations, CROFT uses only two threads where one thread is dedicated for communication so that it can be effectively overlapped with computations. Thus, depending on the number of processes used, CROFT achieves performance improvement of about 51

READ FULL TEXT

page 5

page 7

page 8

page 11

page 21

page 22

page 23

research
09/06/2022

A Multi-FPGA High Performance Computing System for 3D FFT-based Numerical Simulations

In the field of High Performance Computing, communications among process...
research
02/09/2020

Large-Scale Discrete Fourier Transform on TPUs

In this work, we present two parallel algorithms for the large-scale dis...
research
05/07/2019

P3DFFT: a framework for parallel computations of Fourier transforms in three dimensions

Fourier and related transforms is a family of algorithms widely employed...
research
08/29/2022

Breaking Down the Parallel Performance of GROMACS, a High-Performance Molecular Dynamics Software

GROMACS is one of the most widely used HPC software packages using the M...
research
03/22/2022

Minimizing communication in the multidimensional FFT

We present a parallel algorithm for the fast Fourier transform (FFT) in ...
research
11/14/2022

FLUPS – a flexible and performant massively parallel Fourier transform library

Massively parallel Fourier transforms are widely used in computational s...
research
06/07/2019

Lightweight Parallel Foundations: a model-compliant communication layer

We present the Lightweight Parallel Foundations (LPF), an interoperable ...

Please sign up or login with your details

Forgot password? Click here to reset