Analyzing Performance Properties Collected by the PerSyst Scalable HPC Monitoring Tool

09/13/2020
by   David Brayford, et al.
0

The ability to understand how a scientific application is executed on a large HPC system is of great importance in allocating resources within the HPC data center. In this paper, we describe how we used system performance data to identify: execution patterns, possible code optimizations and improvements to the system monitoring. We also identify candidates for employing machine learning techniques to predict the performance of similar scientific codes.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 2

05/06/2020

Catch Me If You Can: Using Power Analysis to Identify HPC Activity

Monitoring users on large computing platforms such as high performance c...
09/10/2021

An Execution Fingerprint Dictionary for HPC Application Recognition

Applications running on HPC systems waste time and energy if they: (a) u...
02/15/2019

Studying the Impact of Power Capping on MapReduce-based, Data-intensive Mini-applications on Intel KNL and KNM Architectures

In this poster, we quantitatively measure the impacts of data movement o...
07/07/2020

Analytics of Longitudinal System Monitoring Data for Performance Prediction

In recent years, several HPC facilities have started continuous monitori...
12/18/2018

A Preliminary Study of Neural Network-based Approximation for HPC Applications

Machine learning, as a tool to learn and model complicated (non)linear r...
04/18/2022

A Taxonomy of Error Sources in HPC I/O Machine Learning Models

I/O efficiency is crucial to productivity in scientific computing, but t...
08/31/2019

Detecting Covert Cryptomining using HPC

Cybercriminals have been exploiting cryptocurrencies to commit various u...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.