Performance optimizations for scalable CFD applications on hybrid CPU+MIC heterogeneous computing system with millions of cores

10/27/2017
by   Yong-Xian Wang, et al.
0

For computational fluid dynamics (CFD) applications with a large number of grid points/cells, parallel computing is a common efficient strategy to reduce the computational time. How to achieve the best performance in the modern supercomputer system, especially with heterogeneous computing resources such as hybrid CPU+GPU, or a CPU + Intel Xeon Phi (MIC) co-processors, is still a great challenge. An in-house parallel CFD code capable of simulating three dimensional structured grid applications is developed and tested in this study. Several methods of parallelization, performance optimization and code tuning both in the CPU-only homogeneous system and in the heterogeneous system are proposed based on identifying potential parallelism of applications, balancing the work load among all kinds of computing devices, tuning the multi-thread code toward better performance in intra-machine node with hundreds of CPU/MIC cores, and optimizing the communication among inter-nodes, inter-cores, and between CPUs and MICs. Some benchmark cases from model and/or industrial CFD applications are tested on the Tianhe-1A and Tianhe-2 supercomputer to evaluate the performance. Among these CFD cases, the maximum number of grid cells reached 780 billion. The tuned solver successfully scales up to half of the entire Tianhe-2 supercomputer system with over 1.376 million of heterogeneous cores. The test results and performance analysis are discussed in detail.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/29/2023

CPU-GPU Heterogeneous Code Acceleration of a Finite Volume Computational Fluid Dynamics Solver

This work deals with the CPU-GPU heterogeneous code acceleration of a fi...
research
05/03/2021

[Re] Three-dimensional wake topology and propulsive performance of low-aspect-ratio pitching-rolling plates

This article reports on a full replication study in computational fluid ...
research
10/20/2020

Modeling Data Movement Performance on Heterogeneous Architectures

The cost of data movement on parallel systems varies greatly with machin...
research
05/12/2020

Heterogeneous CPU/GPU co-execution of CFD simulations on the POWER9 architecture: Application to airplane aerodynamics

High fidelity Computational Fluid Dynamics simulations are generally ass...
research
02/16/2018

New High Performance GPGPU Code Transformation Framework Applied to Large Production Weather Prediction Code

We introduce "Hybrid Fortran", a new approach that allows a high perform...
research
05/31/2023

ReDSEa: Automated Acceleration of Triangular Solver on Supercloud Heterogeneous Systems

When utilized effectively, Supercloud heterogeneous systems have the pot...
research
11/01/2017

Deep and Shallow convections in Atmosphere Models on Intel Xeon Phi Coprocessor Systems

Deep and shallow convection calculations occupy significant times in atm...

Please sign up or login with your details

Forgot password? Click here to reset