CILP: Co-simulation based Imitation Learner for Dynamic Resource Provisioning in Cloud Computing Environments

02/11/2023
by   Shreshth Tuli, et al.
0

Intelligent Virtual Machine (VM) provisioning is central to cost and resource efficient computation in cloud computing environments. As bootstrapping VMs is time-consuming, a key challenge for latency-critical tasks is to predict future workload demands to provision VMs proactively. However, existing AI-based solutions tend to not holistically consider all crucial aspects such as provisioning overheads, heterogeneous VM costs and Quality of Service (QoS) of the cloud system. To address this, we propose a novel method, called CILP, that formulates the VM provisioning problem as two sub-problems of prediction and optimization, where the provisioning plan is optimized based on predicted workload demands. CILP leverages a neural network as a surrogate model to predict future workload demands with a co-simulated digital-twin of the infrastructure to compute QoS scores. We extend the neural network to also act as an imitation learner that dynamically decides the optimal VM provisioning plan. A transformer based neural model reduces training and inference overheads while our novel two-phase decision making loop facilitates in making informed provisioning decisions. Crucially, we address limitations of prior work by including resource utilization, deployment costs and provisioning overheads to inform the provisioning decisions in our imitation learning framework. Experiments with three public benchmarks demonstrate that CILP gives up to 22 higher resource utilization, 14 costs compared to the current online and offline optimization based state-of-the-art methods.

READ FULL TEXT

page 1

page 10

research
05/21/2022

Learning to Dynamically Select Cost Optimal Schedulers in Cloud Computing Environments

The operational cost of a cloud computing platform is one of the most si...
research
05/21/2022

MetaNet: Automated Dynamic Selection of Scheduling Policies in Cloud Environments

Task scheduling is a well-studied problem in the context of optimizing t...
research
01/19/2022

PROMPT: Learning Dynamic Resource Allocation Policies for Edge-Network Applications

A growing number of service providers are exploring methods to improve s...
research
04/14/2022

RobustScaler: QoS-Aware Autoscaling for Complex Workloads

Autoscaling is a critical component for efficient resource utilization w...
research
05/12/2018

Incremental Learning Framework Using Cloud Computing

High volume of data, perceived as either challenge or opportunity. Deep ...
research
12/04/2022

A proactive autoscaling and energy-efficient VM allocation framework using online multi-resource neural network for cloud data center

This work proposes an energy-efficient resource provisioning and allocat...
research
12/02/2022

Predict-and-Critic: Accelerated End-to-End Predictive Control for Cloud Computing through Reinforcement Learning

Cloud computing holds the promise of reduced costs through economies of ...

Please sign up or login with your details

Forgot password? Click here to reset