HUNTER: AI based Holistic Resource Management for Sustainable Cloud Computing

10/11/2021
by   Shreshth Tuli, et al.
6

The worldwide adoption of cloud data centers (CDCs) has given rise to the ubiquitous demand for hosting application services on the cloud. Further, contemporary data-intensive industries have seen a sharp upsurge in the resource requirements of modern applications. This has led to the provisioning of an increased number of cloud servers, giving rise to higher energy consumption and, consequently, sustainability concerns. Traditional heuristics and reinforcement learning based algorithms for energy-efficient cloud resource management address the scalability and adaptability related challenges to a limited extent. Existing work often fails to capture dependencies across thermal characteristics of hosts, resource consumption of tasks and the corresponding scheduling decisions. This leads to poor scalability and an increase in the compute resource requirements, particularly in environments with non-stationary resource demands. To address these limitations, we propose an artificial intelligence (AI) based holistic resource management technique for sustainable cloud computing called HUNTER. The proposed model formulates the goal of optimizing energy efficiency in data centers as a multi-objective scheduling problem, considering three important models: energy, thermal and cooling. HUNTER utilizes a Gated Graph Convolution Network as a surrogate model for approximating the Quality of Service (QoS) for a system state and generating optimal scheduling decisions. Experiments on simulated and physical cloud environments using the CloudSim toolkit and the COSCO framework show that HUNTER outperforms state-of-the-art baselines in terms of energy consumption, SLA violation, scheduling time, cost and temperature by up to 12, 35, 43, 54 and 3 percent respectively.

READ FULL TEXT

page 4

page 6

page 9

page 10

page 13

page 15

page 16

research
04/25/2021

Performance and Energy-Aware Bi-objective Tasks Scheduling for Cloud Data Centers

Cloud computing enables remote execution of users tasks. The pervasive a...
research
12/14/2021

MCDS: AI Augmented Workflow Scheduling in Mobile Edge Cloud Computing Systems

Workflow scheduling is a long-studied problem in parallel and distribute...
research
05/21/2022

MetaNet: Automated Dynamic Selection of Scheduling Policies in Cloud Environments

Task scheduling is a well-studied problem in the context of optimizing t...
research
09/30/2019

HolDCSim: A Holistic Simulator for Data Centers

Cloud computing based systems, that span data centers, are commonly depl...
research
11/04/2021

MUVINE: Multi-stage Virtual Network Embedding in Cloud Data Centers using Reinforcement Learning based Predictions

The recent advances in virtualization technology have enabled the sharin...
research
02/12/2020

Energy Efficient Algorithms based on VM Consolidation for Cloud Computing: Comparisons and Evaluations

Cloud Computing paradigm has revolutionized IT industry and be able to o...

Please sign up or login with your details

Forgot password? Click here to reset