A Data-Driven Approach to Dynamically Adjust Resource Allocation for Compute Clusters

07/01/2018
by   Francesco Pace, et al.
0

Nowadays, data-centers are largely under-utilized because resource allocation is based on reservation mechanisms which ignore actual resource utilization. Indeed, it is common to reserve resources for peak demand, which may occur only for a small portion of the application life time. As a consequence, cluster resources often go under-utilized. In this work, we propose a mechanism that improves cluster utilization, thus decreasing the average turnaround time, while preventing application failures due to contention in accessing finite resources such as RAM. Our approach monitors resource utilization and employs a data-driven approach to resource demand forecasting, featuring quantification of uncertainty in the predictions. Using demand forecast and its confidence, our mechanism modulates cluster resources assigned to running applications, and reduces the turnaround time by more than one order of magnitude while keeping application failures under control. Thus, tenants enjoy a responsive system and providers benefit from an efficient cluster utilization.

READ FULL TEXT
research
06/02/2020

Flex: Closing the Gaps between Usage and Allocation

Data centers are giant factories of Internet data and services. Worldwid...
research
03/21/2019

Prevention and Mitigation of Catastrophic Failures in Demand-Supply Interdependent Networks

We propose a generic system model for a special category of interdepende...
research
08/22/2023

Demand-driven provisioning of Kubernetes-like resources in OSG

The OSG-operated Open Science Pool is an HTCondor-based virtual cluster ...
research
12/05/2018

ADARES: Adaptive Resource Management for Virtual Machines

Virtual execution environments allow for consolidation of multiple appli...
research
05/22/2019

Online Collection and Forecasting of Resource Utilization in Large-Scale Distributed Systems

Large-scale distributed computing systems often contain thousands of dis...
research
05/23/2022

MetaSlicing: A Novel Resource Allocation Framework for Metaverse

Creating and maintaining the Metaverse requires enormous resources that ...
research
01/03/2014

A Framework for Creating a Distributed Rendering Environment on the Compute Clusters

This paper discusses the deployment of existing render farm manager in a...

Please sign up or login with your details

Forgot password? Click here to reset