Exploring the Fairness and Resource Distribution in an Apache Mesos Environment

05/21/2019
by   Pankaj Saha, et al.
0

Apache Mesos, a cluster-wide resource manager, is widely deployed in massive scale at several Clouds and Data Centers. Mesos aims to provide high cluster utilization via fine grained resource co-scheduling and resource fairness among multiple users through Dominant Resource Fairness (DRF) based allocation. DRF takes into account different resource types (CPU, Memory, Disk I/O) requested by each application and determines the share of each cluster resource that could be allocated to the applications. Mesos has adopted a two-level scheduling policy: (1) DRF to allocate resources to competing frameworks and (2) task level scheduling by each framework for the resources allocated during the previous step. We have conducted experiments in a local Mesos cluster when used with frameworks such as Apache Aurora, Marathon, and our own framework Scylla, to study resource fairness and cluster utilization. Experimental results show how informed decision regarding second level scheduling policy of frameworks and attributes like offer holding period, offer refusal cycle and task arrival rate can reduce unfair resource distribution. Bin-Packing scheduling policy on Scylla with Marathon can reduce unfair allocation from 38% to 3%. By reducing unused free resources in offers we bring down the unfairness from to 90% to 28%. We also show the effect of task arrival rate to reduce the unfairness from 23% to 7%.

READ FULL TEXT
research
05/21/2019

Tromino: Demand and DRF Aware Multi-Tenant Queue Manager for Apache Mesos Cluster

Apache Mesos, a two-level resource scheduler, provides resource sharing ...
research
08/28/2020

SAF: Simulated Annealing Fair Scheduling for Hadoop Yarn Clusters

Apache introduced YARN as the next generation of the Hadoop framework, p...
research
01/28/2020

A Scheduling Algorithm to Maximize Storm Throughput in Heterogeneous Cluster

In the most popular distributed stream processing frameworks (DSPFs), pr...
research
02/25/2021

Optimized Memoryless Fair-Share HPC Resources Scheduling using Transparent Checkpoint-Restart Preemption

Common resource management methods in supercomputing systems usually inc...
research
05/02/2018

Avalon: Building an Operating System for Robotcenter

This paper envisions a scenario that hundreds of heterogeneous robots fo...
research
12/22/2018

Bioinformatics Computational Cluster Batch Task Profiling with Machine Learning for Failure Prediction

Motivation: Traditional computational cluster schedulers are based on us...
research
09/18/2020

C-Balancer: A System for Container Profiling and Scheduling

Linux containers have gained high popularity in recent times. This popul...

Please sign up or login with your details

Forgot password? Click here to reset