JobPruner: A Machine Learning Assistant for Exploring Parameter Spaces in HPC Applications

02/03/2018
by   Bruno Silva, et al.
0

High Performance Computing (HPC) applications are essential for scientists and engineers to create and understand models and their properties. These professionals depend on the execution of large sets of computational jobs that explore combinations of parameter values. Avoiding the execution of unnecessary jobs brings not only speed to these experiments, but also reductions in infrastructure usage---particularly important due to the shift of these applications to HPC cloud platforms. Our hypothesis is that data generated by these experiments can help users in identifying such jobs. To address this hypothesis we need to understand the similarity levels among multiple experiments necessary for job elimination decisions and the steps required to automate this process. In this paper we present a study and a machine learning-based tool called JobPruner to support parameter exploration in HPC experiments. The tool was evaluated with three real-world use cases from different domains including seismic analysis and agronomy. We observed the tool reduced 93 scenarios. In addition, reduction in job executions was possible even considering past experiments with low correlations.

READ FULL TEXT

page 4

page 8

page 10

research
07/15/2023

PSI/J: A Portable Interface for Submitting, Monitoring, and Managing Jobs

It is generally desirable for high-performance computing (HPC) applicati...
research
03/24/2021

Towards Accommodating Real-time Jobs on HPC Platforms

Increasing data volumes in scientific experiments necessitate the use of...
research
12/18/2018

A Preliminary Study of Neural Network-based Approximation for HPC Applications

Machine learning, as a tool to learn and model complicated (non)linear r...
research
04/18/2022

A Taxonomy of Error Sources in HPC I/O Machine Learning Models

I/O efficiency is crucial to productivity in scientific computing, but t...
research
05/09/2019

parasweep: A template-based utility for generating, dispatching, and post-processing of parameter sweeps

We introduce parasweep, a free and open-source utility for facilitating ...
research
12/20/2016

CannyFS: Opportunistically Maximizing I/O Throughput Exploiting the Transactional Nature of Batch-Mode Data Processing

We introduce a user mode file system, CannyFS, that hides latency by ass...
research
01/12/2018

Effect of Meltdown and Spectre Patches on the Performance of HPC Applications

In this work we examine how the updates addressing Meltdown and Spectre ...

Please sign up or login with your details

Forgot password? Click here to reset