Investigating Applications on the A64FX

09/24/2020
by   Adrian Jackson, et al.
0

The A64FX processor from Fujitsu, being designed for computational simulation and machine learning applications, has the potential for unprecedented performance in HPC systems. In this paper, we evaluate the A64FX by benchmarking against a range of production HPC platforms that cover a number of processor technologies. We investigate the performance of complex scientific applications across multiple nodes, as well as single node and mini-kernel benchmarks. This paper finds that the performance of the A64FX processor across our chosen benchmarks often significantly exceeds other platforms, even without specific application optimisations for the processor instruction set or hardware. However, this is not true for all the benchmarks we have undertaken. Furthermore, the specific configuration of applications can have an impact on the runtime and performance experienced.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/12/2017

Benchmarking Data Analysis and Machine Learning Applications on the Intel KNL Many-Core Processor

Knights Landing (KNL) is the code name for the second-generation Intel X...
research
01/10/2023

Exploring the Use of WebAssembly in HPC

Containerization approaches based on namespaces offered by the Linux ker...
research
01/12/2018

Effect of Meltdown and Spectre Patches on the Performance of HPC Applications

In this work we examine how the updates addressing Meltdown and Spectre ...
research
07/27/2023

Benchmarking Performance of Deep Learning Model for Material Segmentation on Two HPC Systems

Performance Benchmarking of HPC systems is an ongoing effort that seeks ...
research
04/21/2023

STaKTAU: profiling HPC applications' operating system usage

This paper presents a approach for measuring the time spent by HPC appli...
research
08/14/2014

Cortical Processing with Thermodynamic-RAM

AHaH computing forms a theoretical framework from which a biologically-i...
research
10/13/2019

Hardware/Software Codesign for Training/Testing Multiple Neural Networks on Multiple FPGAs

Most neural network designs for FPGAs are inflexible. In this paper, we ...

Please sign up or login with your details

Forgot password? Click here to reset