H2O-Cloud: A Resource and Quality of Service-Aware Task Scheduling Framework for Warehouse-Scale Data Centers – A Hierarchical Hybrid DRL (Deep Reinforcement Learning) based

12/20/2019
by   Mingxi Cheng, et al.
0

Cloud computing has attracted both end-users and Cloud Service Providers (CSPs) in recent years. Improving resource utilization rate (RUtR), such as CPU and memory usages on servers, while maintaining Quality-of-Service (QoS) is one key challenge faced by CSPs with warehouse-scale data centers. Prior works proposed various algorithms to reduce energy cost or to improve RUtR, which either lack the fine-grained task scheduling capabilities, or fail to take a comprehensive system model into consideration. This article presents H2O-Cloud, a Hierarchical and Hybrid Online task scheduling framework for warehouse-scale CSPs, to improve resource usage effectiveness while maintaining QoS. H2O-Cloud is highly scalable and considers comprehensive information such as various workload scenarios, cloud platform configurations, user request information and dynamic pricing model. The hierarchy and hybridity of the framework, combined with its deep reinforcement learning (DRL) engines, enable H2O-Cloud to efficiently start on-the-go scheduling and learning in an unpredictable environment without pre-training. Our experiments confirm the high efficiency of the proposed H2O-Cloud when compared to baseline approaches, in terms of energy and cost while maintaining QoS. Compared with a state-of-the-art DRL-based algorithm, H2O-Cloud achieves up to 201.17 improvement, 47.88 improvement.

READ FULL TEXT
research
12/20/2019

H2O-Cloud: A Resource and Quality of Service-Aware Task Scheduling Framework for Warehouse-Scale Data Centers

Cloud computing has attracted both end-users and Cloud Service Providers...
research
05/10/2021

Deep Reinforcement Learning-based Methods for Resource Scheduling in Cloud Computing: A Review and Future Directions

As the quantity and complexity of information processed by software syst...
research
05/21/2022

MetaNet: Automated Dynamic Selection of Scheduling Policies in Cloud Environments

Task scheduling is a well-studied problem in the context of optimizing t...
research
09/16/2018

Energy Efficient Cloud Control and Pricing in Geographically Distributed Data Centers

It is estimated that data centers constitute 1.5 usage. At the same time...
research
07/14/2021

QoS-Aware Scheduling in New Radio Using Deep Reinforcement Learning

Fifth-generation (5G) New Radio (NR) cellular networks support a wide ra...
research
08/22/2023

A Deep Reinforcement Learning based Algorithm for Time and Cost Optimized Scaling of Serverless Applications

Serverless computing has gained a strong traction in the cloud computing...
research
05/21/2022

Learning to Dynamically Select Cost Optimal Schedulers in Cloud Computing Environments

The operational cost of a cloud computing platform is one of the most si...

Please sign up or login with your details

Forgot password? Click here to reset