Ridgeline: A 2D Roofline Model for Distributed Systems

09/03/2022
by   Fabio Checconi, et al.
0

In this short paper, we introduce the Ridgeline model, an extension of the Roofline model [4] for distributed systems. The Roofline model targets shared memory systems, bounding the performance of a kernel based on its operational intensity, and the peak compute throughput and memory bandwidth of the execution system. In a distributed setting, with multiple communicating compute entities, the network must be taken into account to model the system behavior accurately. The Ridgeline aggregates information on compute, memory, and network limits in one 2D plot to show, in an intuitive way, which of the resources is the expected bottleneck. We show the applicability of the Ridgeline in a case study based on a data-parallel Multi-Layer Perceptron (MLP) instance.

READ FULL TEXT

page 1

page 3

research
07/07/2022

The Case for Distributed Shared-Memory Databases with RDMA-Enabled Memory Disaggregation

Memory disaggregation (MD) allows for scalable and elastic data center d...
research
01/10/2018

DuctTeip: An efficient programming model for distributed task based parallel computing

Current high-performance computer systems used for scientific computing ...
research
04/02/2019

DeLTA: GPU Performance Model for Deep Learning Applications with In-depth Memory System Traffic Analysis

Training convolutional neural networks (CNNs) requires intense compute t...
research
09/16/2018

Analysis of Dynamic Memory Bandwidth Regulation in Multi-core Real-Time Systems

One of the primary sources of unpredictability in modern multi-core embe...
research
02/28/2021

Performance Optimization of SU3_Bench on Xeon and Programmable Integrated Unified Memory Architecture

SU3_Bench is a microbenchmark developed to explore performance portabili...
research
09/30/2020

Communication-Optimal Parallel Standard and Karatsuba Integer Multiplication in the Distributed Memory Model

We present COPSIM a parallel implementation of standard integer multipli...
research
06/15/2021

Modeling memory bandwidth patterns on NUMA machines with performance counters

Computers used for data analytics are often NUMA systems with multiple s...

Please sign up or login with your details

Forgot password? Click here to reset