Can Fortran's 'do concurrent' replace directives for accelerated computing?

10/18/2021
by   Miko M. Stulajter, et al.
0

Recently, there has been growing interest in using standard language constructs (e.g. C++'s Parallel Algorithms and Fortran's do concurrent) for accelerated computing as an alternative to directive-based APIs (e.g. OpenMP and OpenACC). These constructs have the potential to be more portable, and some compilers already (or have plans to) support such standards. Here, we look at the current capabilities, portability, and performance of replacing directives with Fortran's do concurrent using a mini-app that currently implements OpenACC for GPU-acceleration and OpenMP for multi-core CPU parallelism. We replace as many directives as possible with do concurrent, testing various configurations and compiler options within three major compilers: GNU's gfortran, NVIDIA's nvfortran, and Intel's ifort. We find that with the right compiler versions and flags, many directives can be replaced without loss of performance or portability, and, in the case of nvfortran, they can all be replaced. We discuss limitations that may apply to more complicated codes and future language additions that may mitigate them. The software and Singularity containers are publicly provided to allow the results to be reproduced.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/05/2023

Acceleration of a production Solar MHD code with Fortran standard parallelism: From OpenACC to `do concurrent'

There is growing interest in using standard language constructs for acce...
research
07/01/2022

High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs

While parallelism remains the main source of performance, architectural ...
research
11/07/2022

Stardust: Compiling Sparse Tensor Algebra to a Reconfigurable Dataflow Architecture

We introduce Stardust, a compiler that compiles sparse tensor algebra to...
research
02/11/2018

GeoMFree3D: An Under-Development Meshfree Software Package for Geomechanics

This paper briefly reports the GeoMFree3D, a meshfree / meshless softwar...
research
12/23/2019

Parallel Computing With R: A Brief Review

Parallel computing has established itself as another standard method for...
research
09/22/2021

Intel Optane DCPMM and Serverless Computing

This report describes 1) how we use Intel's Optane DCPMM in the memory M...
research
05/31/2023

ReDSEa: Automated Acceleration of Triangular Solver on Supercloud Heterogeneous Systems

When utilized effectively, Supercloud heterogeneous systems have the pot...

Please sign up or login with your details

Forgot password? Click here to reset