A Reinforcement Learning Approach for Performance-aware Reduction in Power Consumption of Data Center Compute Nodes

08/15/2023
by   Akhilesh Raj, et al.
0

As Exascale computing becomes a reality, the energy needs of compute nodes in cloud data centers will continue to grow. A common approach to reducing this energy demand is to limit the power consumption of hardware components when workloads are experiencing bottlenecks elsewhere in the system. However, designing a resource controller capable of detecting and limiting power consumption on-the-fly is a complex issue and can also adversely impact application performance. In this paper, we explore the use of Reinforcement Learning (RL) to design a power capping policy on cloud compute nodes using observations on current power consumption and instantaneous application performance (heartbeats). By leveraging the Argo Node Resource Management (NRM) software stack in conjunction with the Intel Running Average Power Limit (RAPL) hardware control mechanism, we design an agent to control the maximum supplied power to processors without compromising on application performance. Employing a Proximal Policy Optimization (PPO) agent to learn an optimal policy on a mathematical model of the compute nodes, we demonstrate and evaluate using the STREAM benchmark how a trained agent running on actual hardware can take actions by balancing power consumption and application performance.

READ FULL TEXT
research
07/06/2021

Sustaining Performance While Reducing Energy Consumption: A Control Theory Approach

Production high-performance computing systems continue to grow in comple...
research
10/07/2022

PMT: Power Measurement Toolkit

Efficient use of energy is essential for today's supercomputing systems,...
research
03/22/2021

Power Modeling for Effective Datacenter Planning and Compute Management

Datacenter power demand has been continuously growing and is the key dri...
research
03/05/2019

The power disaggregation algorithms and their applications to demand dispatch

We were interested in solving a power disaggregation problem which comes...
research
06/09/2021

IChannels: Exploiting Current Management Mechanisms to Create Covert Channels in Modern Processors

To operate efficiently across a wide range of workloads with varying pow...
research
09/06/2016

A Hardware-Efficient Approach to Computing the Rotation Matrix from a Quaternion

In this paper, we have proposed a novel VLSI-oriented approach to comput...
research
09/26/2019

A Simulation of UAV Power Optimization via Reinforcement Learning

This paper demonstrates a reinforcement learning approach to the optimiz...

Please sign up or login with your details

Forgot password? Click here to reset