Toward an End-to-End Auto-tuning Framework in HPC PowerStack

08/14/2020
by   Xingfu Wu, et al.
0

Efficiently utilizing procured power and optimizing performance of scientific applications under power and energy constraints are challenging. The HPC PowerStack defines a software stack to manage power and energy of high-performance computing systems and standardizes the interfaces between different components of the stack. This survey paper presents the findings of a working group focused on the end-to-end tuning of the PowerStack. First, we provide a background on the PowerStack layer-specific tuning efforts in terms of their high-level objectives, the constraints and optimization goals, layer-specific telemetry, and control parameters, and we list the existing software solutions that address those challenges. Second, we propose the PowerStack end-to-end auto-tuning framework, identify the opportunities in co-tuning different layers in the PowerStack, and present specific use cases and solutions. Third, we discuss the research opportunities and challenges for collective auto-tuning of two or more management layers (or domains) in the PowerStack. This paper takes the first steps in identifying and aggregating the important R D challenges in streamlining the optimization efforts across the layers of the PowerStack.

READ FULL TEXT
research
05/16/2020

Toward End-to-End, Full-Stack 6G Terahertz Networks

Recent evolutions in semiconductors and photonics have brought the terah...
research
06/26/2023

LM4HPC: Towards Effective Language Model Application in High-Performance Computing

In recent years, language models (LMs), such as GPT-4, have been widely ...
research
12/10/2021

(R)SE challenges in HPC

We discuss some specific software engineering challenges in the field of...
research
08/01/2018

Container solutions for HPC Systems: A Case Study of Using Shifter on Blue Waters

Software container solutions have revolutionized application development...
research
10/28/2018

FFT, FMM, and Multigrid on the Road to Exascale: performance challenges and opportunities

FFT, FMM, and multigrid methods are widely used fast and highly scalable...
research
02/15/2018

Input-Aware Auto-Tuning of Compute-Bound HPC Kernels

Efficient implementations of HPC applications for parallel architectures...
research
12/28/2020

SimBricks: End-to-End Network System Evaluation with Modular Simulation

Full system "end-to-end" measurements in physical testbeds are the gold ...

Please sign up or login with your details

Forgot password? Click here to reset