Interconnect Bandwidth Heterogeneity on AMD MI250x and Infinity Fabric

02/28/2023
by   Carl Pearson, et al.
0

Demand for low-latency and high-bandwidth data transfer between GPUs has driven the development of multi-GPU nodes. Physical constraints on the manufacture and integration of such systems has yielded heterogeneous intra-node interconnects, where not all devices are connected equally. The next generation of supercomputing platforms are expected to feature AMD CPUs and GPUs. This work characterizes the extent to which interconnect heterogeneity is visible through GPU programming APIs on a system with four AMD MI250x GPUs, and provides several insights for users of such systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/20/2013

Multi-GPU Training of ConvNets

In this work we evaluate different approaches to parallelize computation...
research
04/15/2021

Performance Analysis and Optimization Opportunities for NVIDIA Automotive GPUs

Advanced Driver Assistance Systems (ADAS) and Autonomous Driving (AD) br...
research
11/28/2022

Development of an Equation-based Parallelization Method for Multiphase Particle-in-Cell Simulations

Manufacturers have been developing new graphics processing unit (GPU) no...
research
10/20/2018

Learning-based Application-Agnostic 3D NoC Design for Heterogeneous Manycore Systems

The rising use of deep learning and other big-data algorithms has led to...
research
12/11/2020

Trash Talk: Accelerating Garbage Collection on Integrated GPUs is Worthless

Systems integrating heterogeneous processors with unified memory provide...
research
09/03/2023

FusionAI: Decentralized Training and Deploying LLMs with Massive Consumer-Level GPUs

The rapid growth of memory and computation requirements of large languag...
research
05/14/2022

A Low-latency Communication Design for Brain Simulations

Brain simulation, as one of the latest advances in artificial intelligen...

Please sign up or login with your details

Forgot password? Click here to reset