Prediction-Based Power Oversubscription in Cloud Platforms

by   Alok Kumbhare, et al.

Datacenter designers rely on conservative estimates of IT equipment power draw to provision resources. This leaves resources underutilized and requires more datacenters to be built. Prior work has used power capping to shave the rare power peaks and add more servers to the datacenter, thereby oversubscribing its resources and lowering capital costs. This works well when the workloads and their server placements are known. Unfortunately, these factors are unknown in public clouds, forcing providers to limit the oversubscription so that performance is never impacted. In this paper, we argue that providers can use predictions of workload performance criticality and virtual machine (VM) resource utilization to increase oversubscription. This poses many challenges, such as identifying the performance-critical workloads from black-box VMs, creating support for criticality-aware power management, and increasing oversubscription while limiting the impact of capping. We address these challenges for the hardware and software infrastructures of Microsoft Azure. The results show that we enable a 2x increase in oversubscription with minimum impact to critical workloads.



There are no comments yet.


page 10


Understanding Cloud Workloads Performance in a Production like Environment

Understanding inter-VM interference is of paramount importance to provid...

IOTune: A G-states Driver for Elastic Performance of Block Storage

Imagining a disk which provides baseline performance at a relatively low...

FECBench: A Holistic Interference-aware Approach for Application Performance Modeling

Services hosted in multi-tenant cloud platforms often encounter performa...

Compiler-Guided Throughput Scheduling for Many-core Machines

Modern ARM-based servers such as ThunderX and ThunderX2 offer a tremendo...

Disaggregating Non-Volatile Memory for Throughput-Oriented Genomics Workloads

Massive exploitation of next-generation sequencing technologies requires...

Power Modeling for Effective Datacenter Planning and Compute Management

Datacenter power demand has been continuously growing and is the key dri...

An Experimental and Comparative Benchmark Study Examining Resource Utilization in Managed Hadoop Context

Transitioning cloud-based Hadoop from IaaS to PaaS, which are commercial...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.