Learning to Dynamically Select Cost Optimal Schedulers in Cloud Computing Environments

05/21/2022
by   Shreshth Tuli, et al.
0

The operational cost of a cloud computing platform is one of the most significant Quality of Service (QoS) criteria for schedulers, crucial to keep up with the growing computational demands. Several data-driven deep neural network (DNN)-based schedulers have been proposed in recent years that outperform alternative approaches by providing scalable and effective resource management for dynamic workloads. However, state-of-the-art schedulers rely on advanced DNNs with high computational requirements, implying high scheduling costs. In non-stationary contexts, the most sophisticated schedulers may not always be required, and it may be sufficient to rely on low-cost schedulers to temporarily save operational costs. In this work, we propose MetaNet, a surrogate model that predicts the operational costs and scheduling overheads of a large number of DNN-based schedulers and chooses one on-the-fly to jointly optimize job scheduling and execution costs. This facilitates improvements in execution costs, energy usage and service level agreement violations of up to 11

READ FULL TEXT

page 1

page 2

page 3

research
05/21/2022

MetaNet: Automated Dynamic Selection of Scheduling Policies in Cloud Environments

Task scheduling is a well-studied problem in the context of optimizing t...
research
02/11/2023

CILP: Co-simulation based Imitation Learner for Dynamic Resource Provisioning in Cloud Computing Environments

Intelligent Virtual Machine (VM) provisioning is central to cost and res...
research
04/12/2020

Service Level Driven Job Scheduling in Multi-Tier Cloud Computing: A Biologically Inspired Approach

Cloud computing environments often have to deal with random-arrival comp...
research
12/20/2019

H2O-Cloud: A Resource and Quality of Service-Aware Task Scheduling Framework for Warehouse-Scale Data Centers

Cloud computing has attracted both end-users and Cloud Service Providers...
research
08/18/2019

Workload-Aware Opportunistic Energy Efficiency in Multi-FPGA Platforms

The continuous growth of big data applications with high computational a...
research
12/11/2020

Analyzing the Performance of Smart Industry 4.0 Applications on Cloud Computing Systems

Cloud-based Deep Neural Network (DNN) applications that make latency-sen...

Please sign up or login with your details

Forgot password? Click here to reset