Design and Evaluation of Scalable Representations of Communication in Gantt Charts for Large-scale Execution Traces

06/30/2021
by   Connor Scully-Allison, et al.
0

Gantt charts are frequently used to explore execution traces of large-scale parallel programs found in high-performance computing (HPC). In these visualizations, each parallel processor is assigned a row showing the computation state of a processor at a particular time. Lines are drawn between rows to show communication between these processors. When drawn to align equivalent calls across rows, structures can emerge reflecting communication patterns employed by the executing code. However, though these structures have the same definition at any scale, they are obscured by the density of rendered lines when displaying more than a few hundred processors. A more scalable metaphor is necessary to aid HPC experts in understanding communication in large-scale traces. To address this issue, we first conduct an exploratory study to identify what visual features are critical for determining similarity between structures shown at different scales. Based on these findings, we design a set of glyphs for displaying these structures in dense charts. We then conduct a pre-registered user study evaluating how well people interpret communication using our new representation versus their base depictions in large-scale Gantt charts. Through our evaluation, we find that our representation enables users to more accurately identify communication patterns compared to full renderings of dense charts. We discuss the results of our evaluation and findings regarding the design of metaphors for extensible structures.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

page 8

research
09/28/2021

A Look at Communication-Intensive Performance in Julia

The Julia programming language continues to gain popularity both for its...
research
08/08/2023

NSF RESUME HPC Workshop: High-Performance Computing and Large-Scale Data Management in Service of Epidemiological Modeling

The NSF-funded Robust Epidemic Surveillance and Modeling (RESUME) projec...
research
12/12/2018

Real-time cortical simulations - Energy and interconnect scaling on distributed systems

We profile the impact of computation and inter-processor communication o...
research
08/13/2018

CUBE: A scalable framework for large-scale industrial simulations

Writing high performance solvers for engineering applications is a delic...
research
09/14/2017

TraceTracker: Hardware/Software Co-Evaluation for Large-Scale I/O Workload Reconstruction

Block traces are widely used for system studies, model verifications, an...
research
07/10/2018

Understanding Differences among Executions with Variational Traces

One of the main challenges of debugging is to understand why the program...
research
03/07/2023

Notable: On-the-fly Assistant for Data Storytelling in Computational Notebooks

Computational notebooks are widely used for data analysis. Their interle...

Please sign up or login with your details

Forgot password? Click here to reset