Performance analysis and optimization of the JOREK code for many-core CPUs

10/10/2018
by   T. B. Fehér, et al.
0

This report investigates the performance of the JOREK code on the Intel Knights Landing and Skylake processor architectures. The OpenMP scaling of the matrix construction part of the code was analyzed and improved synchronization methods were implemented. A new switch was implemented to control the number of threads used for the linear equation solver independently from other parts of the code. The matrix construction subroutine was vectorized, and the data locality was also improved. These steps led to a factor of two speedup for the matrix construction.

READ FULL TEXT

page 2

page 13

research
10/19/2017

DD-αAMG on QPACE 3

We describe our experience porting the Regensburg implementation of the ...
research
07/27/2022

Performance of an Astrophysical Radiation Hydrodynamics Code under Scalable Vector Extension Optimization

We present results of a performance study of an astrophysical radiation ...
research
07/12/2017

Benchmarking Data Analysis and Machine Learning Applications on the Intel KNL Many-Core Processor

Knights Landing (KNL) is the code name for the second-generation Intel X...
research
05/04/2023

Notes on Refactoring Exponential Macros in Common Lisp

I recently consulted for a very big Common Lisp project having more than...
research
03/27/2018

Extreme Scale FMM-Accelerated Boundary Integral Equation Solver for Wave Scattering

Algorithmic and architecture-oriented optimizations are essential for ac...
research
12/21/2018

Correct by construction

Matrix code allows one to discover algorithms and to render them in code...
research
07/12/2019

Simulating Nonlinear Neutrino Oscillations on Next-Generation Many-Core Architectures

In this work an astrophysical simulation code, XFLAT, is developed to st...

Please sign up or login with your details

Forgot password? Click here to reset