MAPA: Multi-Accelerator Pattern Allocation Policy for Multi-Tenant GPU Servers

10/07/2021
by   Kiran Ranganath, et al.
0

Multi-accelerator servers are increasingly being deployed in shared multi-tenant environments (such as in cloud data centers) in order to meet the demands of large-scale compute-intensive workloads. In addition, these accelerators are increasingly being inter-connected in complex topologies and workloads are exhibiting a wider variety of inter-accelerator communication patterns. However, existing allocation policies are ill-suited for these emerging use-cases. Specifically, this work identifies that multi-accelerator workloads are commonly fragmented leading to reduced bandwidth and increased latency for inter-accelerator communication. We propose Multi-Accelerator Pattern Allocation (MAPA), a graph pattern mining approach towards providing generalized allocation support for allocating multi-accelerator workloads on multi-accelerator servers. We demonstrate that MAPA is able to improve the execution time of multi-accelerator workloads and that MAPA is able to provide generalized benefits across various accelerator topologies. Finally, we demonstrate a speedup of 12.4 execution time reduced by up to 35

READ FULL TEXT

page 9

page 12

research
03/08/2021

AVEC: Accelerator Virtualization in Cloud-Edge Computing for Deep Learning Libraries

Edge computing offers the distinct advantage of harnessing compute capab...
research
07/23/2023

MARS: Exploiting Multi-Level Parallelism for DNN Workloads on Adaptive Multi-Accelerator Systems

Along with the fast evolution of deep neural networks, the hardware syst...
research
06/19/2023

A multithread AES accelerator for Cyber-Physical Systems

Computing elements of CPSs must be flexible to ensure interoperability; ...
research
06/26/2018

Improving tasks throughput on accelerators using OpenCL command concurrency

A heterogeneous architecture composed by a host and an accelerator must ...
research
12/09/2022

Mining CryptoNight-Haven on the Varium C1100 Blockchain Accelerator Card

Cryptocurrency mining is an energy-intensive process that presents a pri...
research
03/17/2022

Beauty and the beast: A case study on performance prototyping of data-intensive containerized cloud applications

Data-intensive container-based cloud applications have become popular wi...
research
11/25/2021

A Dense Tensor Accelerator with Data Exchange Mesh for DNN and Vision Workloads

We propose a dense tensor accelerator called VectorMesh, a scalable, mem...

Please sign up or login with your details

Forgot password? Click here to reset