DeepAI AI Chat
Log In Sign Up

GPU Acceleration of Hermite Methods for the Simulation of Wave Propagation

by   Arturo Vargas, et al.

The Hermite methods of Goodrich, Hagstrom, and Lorenz (2006) use Hermite interpolation to construct high order numerical methods for hyperbolic initial value problems. The structure of the method has several favorable features for parallel computing. In this work, we propose algorithms that take advantage of the many-core architecture of Graphics Processing Units. The algorithm exploits the compact stencil of Hermite methods and uses data structures that allow for efficient data load and stores. Additionally the highly localized evolution operator of Hermite methods allows us to combine multi-stage time-stepping methods within the new algorithms incurring minimal accesses of global memory. Using a scalar linear wave equation, we study the algorithm by considering Hermite interpolation and evolution as individual kernels and alternatively combined them into a monolithic kernel. For both approaches we demonstrate strategies to increase performance. Our numerical experiments show that although a two kernel approach allows for better performance on the hardware, a monolithic kernel can offer a comparable time to solution with less global memory usage.


page 1

page 2

page 3

page 4


Removing numerical dispersion from linear evolution equations

In this paper we describe a method for removing the numerical errors in ...

The perfectly matched layer (PML) for hyperbolic wave propagation problems: A review

It is well-known that reliable and efficient domain truncation is crucia...

Accelerating wave-propagation algorithms with adaptive mesh refinement using the Graphics Processing Unit (GPU)

Clawpack is a library for solving nonlinear hyperbolic partial different...

Massively scalable stencil algorithm

Stencil computations lie at the heart of many scientific and industrial ...

Operator approximation of the wave equation based on deep learning of Green's function

Deep operator networks (DeepONets) have demonstrated their capability of...

Fast mass lumped multiscale wave propagation modelling

In this paper, we investigate the use of a mass lumped fully explicit ti...

The Megopolis Resampler: Memory Coalesced Resampling on GPUs

The resampling process employed in widely used methods such as Importanc...