Automating GPU Scalability for Complex Scientific Models: Phonon Boltzman Transport Equation

05/30/2023
by   Eric Heisler, et al.
0

Heterogeneous computing environments combining CPU and GPU resources provide a great boost to large-scale scientific computing applications. Code generation utilities that partition the work into CPU and GPU tasks while considering data movement costs allow researchers to more quickly and easily develop high-performance solutions, and make these resources accessible to a larger user base. We present developments for a domain-specific language (DSL) and code generation framework for solving partial differential equations (PDEs). These enhancements facilitate GPU-accelerated solution of the Boltzmann transport equation (BTE) for phonons, which is the governing equation for simulating thermal transport in semiconductor materials at sub-micron scales. The solution of the BTE involves thousands of coupled PDEs as well as complicated boundary conditions and nonlinear processing at each time step. These developments enable the DSL to generate configurable hybrid GPU/CPU code that couples accelerated kernels with user-defined code. We observed performance improvements of around 18X compared to a CPU-only version produced by this same DSL with minimal additional programming effort.

READ FULL TEXT

page 5

page 7

page 9

research
07/16/2023

ProtoX: A First Look

We present a first look at ProtoX, a code generation framework for stenc...
research
08/08/2018

Accelerating wave-propagation algorithms with adaptive mesh refinement using the Graphics Processing Unit (GPU)

Clawpack is a library for solving nonlinear hyperbolic partial different...
research
07/18/2017

GPU accelerated computation of Polarized Subsurface BRDF for Flat Particulate Layers

BRDF of most real world materials has two components, the surface BRDF d...
research
11/25/2020

Enabling GPU Accelerated Computing in the SUNDIALS Time Integration Library

As part of the Exascale Computing Project (ECP), a recent focus of devel...
research
03/28/2021

Application of Graphics Processing Units for self-consistent modelling of shallow water dynamics and sediment transport

In this paper, we describe a numerical algorithm for the self-consistent...
research
08/20/2014

Code Generation for High-Level Synthesis of Multiresolution Applications on FPGAs

Multiresolution Analysis (MRA) is a mathematical method that is based on...
research
01/27/2022

IMEXLBM 1.0: A Proxy Application based on the Lattice Boltzmann Method for solving Computational Fluid Dynamic problems on GPUs

The US Department of Energy launched the Exascale Computing Project (ECP...

Please sign up or login with your details

Forgot password? Click here to reset