Exploring Fully Offloaded GPU Stream-Aware Message Passing

06/27/2023
by   Naveen Namashivayam, et al.
0

Modern heterogeneous supercomputing systems are comprised of CPUs, GPUs, and high-speed network interconnects. Communication libraries supporting efficient data transfers involving memory buffers from the GPU memory typically require the CPU to orchestrate the data transfer operations. A new offload-friendly communication strategy, stream-triggered (ST) communication, was explored to allow offloading the synchronization and data movement operations from the CPU to the GPU. A Message Passing Interface (MPI) one-sided active target synchronization based implementation was used as an exemplar to illustrate the proposed strategy. A latency-sensitive nearest neighbor microbenchmark was used to explore the various performance aspects of the implementation. The offloaded implementation shows significant on-node performance advantages over standard MPI active RMA (36 multi-node improvement is less (23 slower than point-to-point), but plans are in progress to purse further improvements.

READ FULL TEXT
research
08/09/2022

Exploring GPU Stream-Aware Message Passing using Triggered Operations

Modern heterogeneous supercomputing systems are comprised of compute bla...
research
01/21/2021

Efficient MPI-based Communication for GPU-Accelerated Dask Applications

Dask is a popular parallel and distributed computing framework, which ri...
research
08/29/2022

MPIX Stream: An Explicit Solution to Hybrid MPI+X Programming

The hybrid MPI+X programming paradigm, where X refers to threads or GPUs...
research
07/01/2020

Simulation and Analysis of Distributed Wireless Sensor Network using Message Passing Interface

Wireless Sensor Networks (WSN) are used by many industries from environm...
research
10/20/2020

Modeling Data Movement Performance on Heterogeneous Architectures

The cost of data movement on parallel systems varies greatly with machin...
research
07/01/2020

Efficient Generation of Mandelbrot Set using Message Passing Interface

With the increasing need for safer and reliable systems, Mandelbrot Set'...
research
06/23/2020

Optimised allgatherv, reduce_scatter and allreduce communication in message-passing systems

Collective communications, namely the patterns allgatherv, reduce_scatte...

Please sign up or login with your details

Forgot password? Click here to reset