A Workload Analysis of NSF's Innovative HPC Resources Using XDMoD

01/12/2018
by   Nikolay A. Simakov, et al.
0

Workload characterization is an integral part of performance analysis of high performance computing (HPC) systems. An understanding of workload properties sheds light on resource utilization and can be used to inform performance optimization both at the software and system configuration levels. It can provide information on how computational science usage modalities are changing that could potentially aid holistic capacity planning for the wider HPC ecosystem. Here, we report on the results of a detailed workload analysis of the portfolio of supercomputers comprising the NSF Innovative HPC program in order to characterize its past and current workload and look for trends to understand the nature of how the broad portfolio of computational science research is being supported and how it is changing over time. The workload analysis also sought to illustrate a wide variety of usage patterns and performance requirements for jobs running on these systems. File system performance, memory utilization and the types of parallelism employed by users (MPI, threads, etc) were also studied for all systems for which job level performance data was available.

READ FULL TEXT

page 14

page 15

page 17

page 18

page 20

page 29

page 32

page 37

research
06/18/2018

AccaSim: a Customizable Workload Management Simulator for Job Dispatching Research in HPC Systems

We present AccaSim, a simulator for workload management in HPC systems. ...
research
01/12/2023

Analyzing Resource Utilization in an HPC System: A Case Study of NERSC Perlmutter

The resource demands of HPC applications vary significantly. However, it...
research
01/09/2023

Efficient Intra-Rack Resource Disaggregation for HPC Using Co-Packaged DWDM Photonics

The diversity of workload requirements and increasing hardware heterogen...
research
12/14/2020

WISE: A Computer System Performance Index Scoring Framework

The performance levels of a computing machine running a given workload c...
research
10/21/2021

Three Practical Workflow Schedulers for Easy Maximum Parallelism

Runtime scheduling and workflow systems are an increasingly popular algo...
research
05/10/2019

Channels, Learning, Queueing and Remote Estimation Systems With A Utilization-Dependent Component

In this article, we survey the main models, techniques, concepts, and re...
research
04/12/2022

The MIT Supercloud Workload Classification Challenge

High-Performance Computing (HPC) centers and cloud providers support an ...

Please sign up or login with your details

Forgot password? Click here to reset