DAG-based Scheduling with Resource Sharing for Multi-task Applications in a Polyglot GPU Runtime

12/17/2020
by   Alberto Parravicini, et al.
0

GPUs are readily available in cloud computing and personal devices, but their use for data processing acceleration has been slowed down by their limited integration with common programming languages such as Python or Java. Moreover, using GPUs to their full capabilities requires expert knowledge of asynchronous programming. In this work, we present a novel GPU run time scheduler for multi-task GPU computations that transparently provides asynchronous execution, space-sharing, and transfer-computation overlap without requiring in advance any information about the program dependency structure. We leverage the GrCUDA polyglot API to integrate our scheduler with multiple high-level languages and provide a platform for fast prototyping and easy GPU acceleration. We validate our work on 6 benchmarks created to evaluate task-parallelism and show an average of 44 slowdown compared to hand-optimized host code written using the C++ CUDA Graphs API.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 1

page 9

10/26/2018

Integration of CUDA Processing within the C++ library for parallelism and concurrency (HPX)

Experience shows that on today's high performance systems the utilizatio...
10/30/2020

Transparent Compiler and Runtime Specializations for Accelerating Managed Languages on FPGAs

In recent years, heterogeneous computing has emerged as the vital way to...
06/14/2019

A Performance Study of the 2D Ising Model on GPUs

The simulation of the two-dimensional Ising model is used as a benchmark...
09/15/2020

Term Rewriting on GPUs

We present a way to implement term rewriting on a GPU. We do this by let...
01/04/2021

Implementing CUDA Streams into AstroAccelerate – A Case Study

To be able to run tasks asynchronously on NVIDIA GPUs a programmer must ...
04/13/2021

ZMCintegral-v5.1: Support for Multi-function Integrations on GPUs

In this new version of ZMCintegral, we have added the functionality of m...
02/20/2021

A Python Framework for Fast Modelling and Simulation of Cellular Nonlinear Networks and other Finite-difference Time-domain Systems

This paper introduces and evaluates a freely available cellular nonlinea...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.