Performance assessment of CUDA and OpenACC in large scale combustion simulations

07/24/2021
by   Guillermo Oyarzun, et al.
0

GPUs have climbed up to the top of supercomputer systems making life harder to many legacy scientific codes. Nowadays, many recipes are being used in such code's portability, without any clarity of which is the best option. We present a comparative analysis of the two most common approaches, CUDA and OpenACC, into the multi-physics CFD code Alya. Our focus is the combustion problems which are one of the most computing demanding CFD simulations. The most computing-intensive parts of the code were analyzed in detail. New data structures for the matrix assembly step have been created to facilitate a SIMD execution that benefits vectorization in the CPU and stream processing in the GPU. As a result, the CPU code has improved its performance by up to 25 GPU execution, CUDA has proven to be up to 2 times faster than OpenACC for the assembly of the matrix. On the contrary, similar performance has been obtained in the kernels related to vector operations used in the linear solver, where there is minimal memory reuse.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/06/2017

Simple and efficient GPU parallelization of existing H-Matrix accelerated BEM code

In this paper, we demonstrate how GPU-accelerated BEM routines can be us...
research
11/30/2022

GPU-Accelerated DNS of Compressible Turbulent Flows

This paper explores strategies to transform an existing CPU-based high-p...
research
05/12/2020

Heterogeneous CPU/GPU co-execution of CFD simulations on the POWER9 architecture: Application to airplane aerodynamics

High fidelity Computational Fluid Dynamics simulations are generally ass...
research
06/09/2023

Efficient parallelization strategy for real-time FE simulations

This paper introduces an efficient and generic framework for finite-elem...
research
10/21/2022

A portable coding strategy to exploit vectorization on combustion simulations

The complexity of combustion simulations demands the latest high-perform...
research
04/08/2022

Performance portable ice-sheet modeling with MALI

High resolution simulations of polar ice-sheets play a crucial role in t...
research
07/16/2021

Refactoring the MPS/University of Chicago Radiative MHD(MURaM) Model for GPU/CPU Performance Portability Using OpenACC Directives

The MURaM (Max Planck University of Chicago Radiative MHD) code is a sol...

Please sign up or login with your details

Forgot password? Click here to reset