Enabling GPU Accelerated Computing in the SUNDIALS Time Integration Library

11/25/2020
by   Cody J. Balos, et al.
0

As part of the Exascale Computing Project (ECP), a recent focus of development efforts for the SUite of Nonlinear and DIfferential/ALgebraic equation Solvers (SUNDIALS) has been to enable GPU-accelerated time integration in scientific applications at extreme scales. This effort has resulted in several new GPU-enabled implementations of core SUNDIALS data structures, support for programming paradigms which are aware of the heterogeneous architectures, and the introduction of utilities to provide new points of flexibility. In this paper, we discuss our considerations, both internal and external, when designing these new features and present the features themselves. We also present performance results for several of the features on the Summit supercomputer and early access hardware for the Frontier supercomputer, which demonstrate negligible performance overhead resulting from the additional infrastructure and significant speedups when using both NVIDIA and AMD GPUs.

READ FULL TEXT

page 4

page 9

page 10

research
11/19/2020

Enabling New Flexibility in the SUNDIALS Suite of Nonlinear and Differential/Algebraic Equation Solvers

In recent years, the SUite of Nonlinear and DIfferential/ALgebraic equat...
research
11/02/2020

Toward Performance-Portable PETSc for GPU-based Exascale Systems

The Portable Extensible Toolkit for Scientific computation (PETSc) libra...
research
09/27/2019

SUNDIALS Multiphysics+MPIManyVector Performance Testing

In this report we document performance test results on a SUNDIALS-based ...
research
04/20/2018

CUDA Support in GNA Data Analysis Framework

Usage of GPUs as co-processors is a well-established approach to acceler...
research
05/30/2023

Automating GPU Scalability for Complex Scientific Models: Phonon Boltzman Transport Equation

Heterogeneous computing environments combining CPU and GPU resources pro...
research
09/14/2021

Measurement and Analysis of GPU-accelerated Applications with HPCToolkit

To address the challenge of performance analysis on the US DOE's forthco...
research
01/11/2022

High Throughput Multidimensional Tridiagonal Systems Solvers on FPGAs

We present a design space exploration for synthesizing optimized, high-t...

Please sign up or login with your details

Forgot password? Click here to reset