New High Performance GPGPU Code Transformation Framework Applied to Large Production Weather Prediction Code

02/16/2018
by   Michel Müller, et al.
0

We introduce "Hybrid Fortran", a new approach that allows a high performance GPGPU port for structured grid Fortran codes. This technique only requires minimal changes for a CPU targeted codebase, which is a significant advancement in terms of productivity. It has been successfully applied to both dynamical core and physical processes of ASUCA, a Japanese mesoscale weather prediction model with more than 150k lines of code. By means of a minimal weather application that resembles ASUCA's code structure, Hybrid Fortran is compared to both a performance model as well as today's commonly used method, OpenACC. As a result, the Hybrid Fortran implementation is shown to deliver the same or better performance than OpenACC and its performance agrees with the model both on CPU and GPU. In a full scale production run, using an ASUCA grid with 1581 x 1301 x 58 cells and real world weather data in 2km resolution, 24 NVIDIA Tesla P100 running the Hybrid Fortran based GPU port are shown to replace more than 50 18-core Intel Xeon Broadwell E5-2695 v4 running the reference implementation - an achievement comparable to more invasive GPGPU rewrites of other weather models.

READ FULL TEXT

page 5

page 11

page 12

page 15

page 16

page 25

page 27

page 40

research
10/24/2017

Hybrid Fortran: High Productivity GPU Porting Framework Applied to Japanese Weather Prediction Model

In this work we use the GPU porting task for the operative Japanese weat...
research
09/27/2020

A directive based hybrid Met Office NERC Cloud model

Large Eddy Simulation is a critical modelling tool for the investigation...
research
11/06/2018

GPU Acceleration of an Established Solar MHD Code using OpenACC

GPU accelerators have had a notable impact on high-performance computing...
research
10/27/2017

Performance optimizations for scalable CFD applications on hybrid CPU+MIC heterogeneous computing system with millions of cores

For computational fluid dynamics (CFD) applications with a large number ...
research
02/12/2018

GPU implementation of algorithm SIMPLE-TS for calculation of unsteady, viscous, compressible and heat-conductive gas flows

The recent trend of using Graphics Processing Units (GPU's) for high per...
research
12/10/2015

Grid: A next generation data parallel C++ QCD library

In this proceedings we discuss the motivation, implementation details, a...
research
09/19/2018

LFRic: Meeting the challenges of scalability and performance portability in Weather and Climate models

This paper describes LFRic: the new weather and climate modelling system...

Please sign up or login with your details

Forgot password? Click here to reset