Lattice QCD on a novel vector architecture

01/21/2020
by   Benjamin Huth, et al.
0

The SX-Aurora TSUBASA PCIe accelerator card is the newest model of NEC's SX architecture family. Its multi-core vector processor features a vector length of 16 kbits and interfaces with up to 48 GB of HBM2 memory in the current models, available since 2018. The compute performance is up to 2.45 TFlop/s peak in double precision, and the memory throughput is up to 1.2 TB/s peak. New models with improved performance characteristics are announced for the near future. In this contribution we discuss key aspects of the SX-Aurora and describe how we enabled the architecture in the Grid Lattice QCD framework.

READ FULL TEXT

page 1

page 2

page 3

page 5

page 6

research
04/05/2018

Early Experience on Using Knights Landing Processors for Lattice Boltzmann Applications

The Knights Landing (KNL) is the codename for the latest generation of I...
research
06/03/2016

GRVI Phalanx: A Massively Parallel RISC-V FPGA Accelerator Accelerator

GRVI is an FPGA-efficient RISC-V RV32I soft processor. Phalanx is a para...
research
09/29/2020

Performance Modeling of Streaming Kernels and Sparse Matrix-Vector Multiplication on A64FX

The A64FX CPU powers the current number one supercomputer on the Top500 ...
research
12/03/2021

Grid on QPACE 4

In 2020 we deployed QPACE 4, which features 64 Fujitsu A64FX model FX700...
research
02/28/2021

Performance Optimization of SU3_Bench on Xeon and Programmable Integrated Unified Memory Architecture

SU3_Bench is a microbenchmark developed to explore performance portabili...
research
03/04/2021

ECM modeling and performance tuning of SpMV and Lattice QCD on A64FX

The A64FX CPU is arguably the most powerful Arm-based processor design t...
research
10/10/2020

Temporal Vectorization for Stencils

Stencil computations represent a very common class of nested loops in sc...

Please sign up or login with your details

Forgot password? Click here to reset