
StreamBrain: An HPC Framework for Brainlike Neural Networks on CPUs, GPUs and FPGAs
The modern deep learning method based on backpropagation has surged in p...
read it

Benchmarking the Nvidia GPU Lineage
For many, Graphics Processing Units (GPUs) provides a source of reliable...
read it

Accelerating Radiation Therapy Dose Calculation with Nvidia GPUs
Radiation Treatment Planning (RTP) is the process of planning the approp...
read it

Matrix Engines for High Performance Computing:A Paragon of Performance or Grasping at Straws?
Matrix engines or units, in different forms and affinities, are becoming...
read it

HighPerformance Spectral Element Methods on FieldProgrammable Gate Arrays
Improvements in computer systems have historically relied on two wellkn...
read it

Automatic Particle Trajectory Classification in Plasma Simulations
Numerical simulations of plasma flows are crucial for advancing our unde...
read it

sputniPIC: an Implicit ParticleinCell Code for MultiGPU Systems
Largescale simulations of plasmas are essential for advancing our under...
read it

tfDarshan: Understanding Finegrained I/O Performance in Machine Learning Workloads
Machine Learning applications on HPC systems have been gaining popularit...
read it

Optimization of Tensorproduct Operations in Nekbone on GPUs
In the CFD solver Nek5000, the computation is dominated by the evaluatio...
read it

White Paper from Workshop on Largescale Parallel Numerical Computing Technology (LSPANC 2020): HPC and Computer Arithmetic toward MinimalPrecision Computing
In numerical computations, precision of floatingpoint computations is a...
read it

While Paper from Workshop on Largescale Parallel Numerical Computing Technology (LSPANC 2020): HPC and Computer Arithmetic toward MinimalPrecision Computing
In numerical computations, precision of floatingpoint computations is a...
read it

A Survey on CoarseGrained Reconfigurable Architectures from a Performance Perspective
With the end of both Dennard's scaling and Moore's law, computer users a...
read it

HighPerformance HighOrder Stencil Computation on FPGAs Using OpenCL
In this paper we evaluate the performance of FPGAs for highorder stenci...
read it

Doubleprecision FPUs in HighPerformance Computing: an Embarrassment of Riches?
Among the (uncontended) common wisdom in HighPerformance Computing (HPC...
read it

Combined Spatial and Temporal Blocking for HighPerformance Stencil Computation on FPGAs Using OpenCL
Recent developments in High Level Synthesis tools have attracted softwar...
read it
Artur Podobas
is this you? claim profile