Accelerating GPU-Based Out-of-Core Stencil Computation with On-the-Fly Compression

09/12/2021
by   Jingcheng Shen, et al.
0

Stencil computation is an important class of scientific applications that can be efficiently executed by graphics processing units (GPUs). Out-of-core approach helps run large scale stencil codes that process data with sizes larger than the limited capacity of GPU memory. However, the performance of the GPU-based out-of-core stencil computation is always limited by the data transfer between the CPU and GPU. Many optimizations have been explored to reduce such data transfer, but the study on the use of on-the-fly compression techniques is far from sufficient. In this study, we propose a method that accelerates the GPU-based out-of-core stencil computation with on-the-fly compression. We introduce a novel data compression approach that solves the data dependency between two contiguous decomposed data blocks. We also modify a widely used GPU-based compression library to support pipelining that overlaps CPU/GPU data transfer with GPU computation. Experimental results show that the proposed method achieved a speedup of 1.2x compared the method without compression. Moreover, although the precision loss involved by compression increased with the number of time steps, the precision loss was trivial up to 4,320 time steps, demonstrating the usefulness of the proposed method.

READ FULL TEXT
research
04/24/2022

Compression-Based Optimizations for Out-of-Core GPU Stencil Computation

An out-of-core stencil computation code handles large data whose size is...
research
04/19/2020

GPU-Accelerated Compression and Visualization of Large-Scale Vessel Trajectories in Maritime IoT Industries

The automatic identification system (AIS), an automatic vessel-tracking ...
research
07/11/2019

Profiling based Out-of-core Hybrid Method for Large Neural Networks

GPUs are widely used to accelerate deep learning with NNs (NNs). On the ...
research
06/21/2011

Accelerating Lossless Data Compression with GPUs

Huffman compression is a statistical, lossless, data compression algorit...
research
10/05/2015

GPU-Based Computation of 2D Least Median of Squares with Applications to Fast and Robust Line Detection

The 2D Least Median of Squares (LMS) is a popular tool in robust regress...
research
05/25/2021

Providing Meaningful Data Summarizations Using Exemplar-based Clustering in Industry 4.0

Data summarizations are a valuable tool to derive knowledge from large d...
research
11/18/2021

QGTC: Accelerating Quantized Graph Neural Networks via GPU Tensor Core

Over the most recent years, quantized graph neural network (QGNN) attrac...

Please sign up or login with your details

Forgot password? Click here to reset