CASH: A Credit Aware Scheduling for Public Cloud Platforms

09/09/2020
by   Aakash Sharma, et al.
0

The public cloud offers a myriad of services which allows its tenants to process large scale big data in a flexible, easy and cost effective manner. Tenants generally use large scale data processing frameworks such as MapReduce, Tez, Spark etc. to process their data. Tenants can configure their frameworks to run individual tasks by the framework itself or have a middleware cluster manager like YARN or Mesos to arbitrate resource scheduling in their public-cloud cluster. Cluster managers need to be cognizant about the workload requirement along with the state of the individual resource such as CPU and disk in the cluster. Cloud providers use a token bucket mechanism for their individual hardware resources as an indicator of the quality-of-service that individual hardware resource can provide. In this paper, through our changes in YARN, Hadoop and Tez, we show how middleware cluster managers can be made cognizant about the expected quality-of-service of individual hardware resources in the cluster. Our optimized cluster manager with a coarse grained knowledge of task requirement and fine grained knowledge of expected quality-of-service of hardware resources in the cluster performs highly optimal task placements. Our experiments with our optimizations show CPU credit based instances like the Amazon T3 instances as a viable cost effective option for running bigdata workloads. We also show that streaming SQL queries on a Hive warehouse can be accelerated by up to 31 of up to 22

READ FULL TEXT

page 10

page 12

research
12/02/2018

Containers Orchestration with Cost-Efficient Autoscaling in Cloud Computing Environments

Containers are standalone, self-contained units that package software an...
research
12/22/2018

Bioinformatics Computational Cluster Batch Task Profiling with Machine Learning for Failure Prediction

Motivation: Traditional computational cluster schedulers are based on us...
research
08/29/2023

Practice of Alibaba Cloud on Elastic Resource Provisioning for Large-scale Microservices Cluster

Cloud-native architecture is becoming increasingly crucial for today's c...
research
07/06/2020

Characterizing BigBench queries, Hive, and Spark in multi-cloud environments

BigBench is the new standard (TPCx-BB) for benchmarking and testing Big ...
research
01/17/2022

GROMACS in the cloud: A global supercomputer to speed up alchemical drug design

We assess costs and efficiency of state-of-the-art high performance clou...
research
01/28/2020

A Scheduling Algorithm to Maximize Storm Throughput in Heterogeneous Cluster

In the most popular distributed stream processing frameworks (DSPFs), pr...
research
08/15/2023

Prism: Revealing Hidden Functional Clusters from Massive Instances in Cloud Systems

Ensuring the reliability of cloud systems is critical for both cloud ven...

Please sign up or login with your details

Forgot password? Click here to reset