Analyzing Performance Properties Collected by the PerSyst Scalable HPC Monitoring Tool

09/13/2020
by   David Brayford, et al.
0

The ability to understand how a scientific application is executed on a large HPC system is of great importance in allocating resources within the HPC data center. In this paper, we describe how we used system performance data to identify: execution patterns, possible code optimizations and improvements to the system monitoring. We also identify candidates for employing machine learning techniques to predict the performance of similar scientific codes.

READ FULL TEXT
research
01/20/2023

ARcode: HPC Application Recognition Through Image-encoded Monitoring Data

Knowing HPC applications of jobs and analyzing their performance behavio...
research
05/06/2020

Catch Me If You Can: Using Power Analysis to Identify HPC Activity

Monitoring users on large computing platforms such as high performance c...
research
09/10/2021

An Execution Fingerprint Dictionary for HPC Application Recognition

Applications running on HPC systems waste time and energy if they: (a) u...
research
02/15/2019

Studying the Impact of Power Capping on MapReduce-based, Data-intensive Mini-applications on Intel KNL and KNM Architectures

In this poster, we quantitatively measure the impacts of data movement o...
research
07/07/2020

Analytics of Longitudinal System Monitoring Data for Performance Prediction

In recent years, several HPC facilities have started continuous monitori...
research
12/18/2018

A Preliminary Study of Neural Network-based Approximation for HPC Applications

Machine learning, as a tool to learn and model complicated (non)linear r...
research
08/31/2019

Detecting Covert Cryptomining using HPC

Cybercriminals have been exploiting cryptocurrencies to commit various u...

Please sign up or login with your details

Forgot password? Click here to reset