P3DFFT: a framework for parallel computations of Fourier transforms in three dimensions

05/07/2019
by   Dmitry Pekurovsky, et al.
0

Fourier and related transforms is a family of algorithms widely employed in diverse areas of computational science, notoriously difficult to scale on high-performance parallel computers with large number of processing elements (cores). This paper introduces a popular software package called P3DFFT implementing Fast Fourier Transforms (FFT) in three dimensions (3D) in a highly efficient and scalable way. It overcomes a well-known scalability bottleneck of 3D FFT implementations by using two-dimensional domain decomposition. Designed for portable performance, P3DFFT achieves excellent timings for a number of systems and problem sizes. On Cray XT5 system P3DFFT attains 45 weak scaling from 128 to 65,536 computational cores. Library features include Fourier and Chebyshev transforms, Fortran and C interfaces, in- and out-of-place transforms, uneven data grids, single and double precision. P3DFFT is available as open source at http://code.google.com/p/p3dfft/. This paper discusses P3DFFT implementation and performance in a way that helps guide the user in making optimal choices for parameters of their runs.

READ FULL TEXT

page 5

page 13

research
02/12/2020

CROFT: A scalable three-dimensional parallel Fast Fourier Transform (FFT) implementation for High Performance Clusters

The FFT of three-dimensional (3D) input data is an important computation...
research
10/04/2021

A New Acceleration Paradigm for Discrete CosineTransform and Other Fourier-Related Transforms

Discrete cosine transform (DCT) and other Fourier-related transforms hav...
research
09/29/2022

Wafer-Scale Fast Fourier Transforms

We have implemented fast Fourier transforms for one, two, and three-dime...
research
05/14/2021

Fast Stencil Computations using Fast Fourier Transforms

Stencil computations are widely used to simulate the change of state of ...
research
02/16/2021

cuFINUFFT: a load-balanced GPU library for general-purpose nonuniform FFTs

Nonuniform fast Fourier transforms dominate the computational cost in ma...
research
11/14/2022

FLUPS – a flexible and performant massively parallel Fourier transform library

Massively parallel Fourier transforms are widely used in computational s...
research
06/09/2023

HVOX: Scalable Interferometric Synthesis and Analysis of Spherical Sky Maps

Analysis and synthesis are key steps of the radio-interferometric imagin...

Please sign up or login with your details

Forgot password? Click here to reset