CUDACLAW: A high-performance programmable GPU framework for the solution of hyperbolic PDEs

05/21/2018
by   H. Gorune Ohannessian, et al.
0

We present cudaclaw, a CUDA-based high performance data-parallel framework for the solution of multidimensional hyperbolic partial differential equation (PDE) systems, equations describing wave motion. cudaclaw allows computational scientists to solve such systems on GPUs without being burdened by the need to write CUDA code, worry about thread and block details, data layout, and data movement between the different levels of the memory hierarchy. The user defines the set of PDEs to be solved via a CUDA- independent serial Riemann solver and the framework takes care of orchestrating the computations and data transfers to maximize arithmetic throughput. cudaclaw treats the different spatial dimensions separately to allow suitable block sizes and dimensions to be used in the different directions, and includes a number of optimizations to minimize access to global memory.

READ FULL TEXT

page 5

page 6

page 7

research
11/18/2021

Error analysis of first time to a threshold value for partial differential equations

We develop an a posteriori error analysis for a novel quantity of intere...
research
10/22/2019

Learning Partial Differential Equations from Data Using Neural Networks

We develop a framework for estimating unknown partial differential equat...
research
07/16/2020

Numerical approximation of port-Hamiltonian systems for hyperbolic or parabolic PDEs with boundary control

The present manuscript concerns the design of structure-preserving discr...
research
04/01/2021

The Two-Dimensional Swept Rule Applied on Heterogeneous Architectures

The partial differential equations describing compressible fluid flows c...
research
05/20/2019

ExaHyPE: An Engine for Parallel Dynamically Adaptive Simulations of Wave Problems

ExaHyPE ("An Exascale Hyperbolic PDE Engine") is a software engine for s...
research
04/19/2021

Multigrid Reduction in Time for non-linear hyperbolic equations

Time-parallel algorithms seek greater concurrency by decomposing the tem...
research
09/06/2016

Accelerating Nuclear Configuration Interaction Calculations through a Preconditioned Block Iterative Eigensolver

We describe a number of recently developed techniques for improving the ...

Please sign up or login with your details

Forgot password? Click here to reset