Analysis of Workflow Schedulers in Simulated Distributed Environments

04/14/2022
by   Jakub Beránek, et al.
0

Task graphs provide a simple way to describe scientific workflows (sets of tasks with dependencies) that can be executed on both HPC clusters and in the cloud. An important aspect of executing such graphs is the used scheduling algorithm. Many scheduling heuristics have been proposed in existing works; nevertheless, they are often tested in oversimplified environments. We provide an extensible simulation environment designed for prototyping and benchmarking task schedulers, which contains implementations of various scheduling algorithms and is open-sourced, in order to be fully reproducible. We use this environment to perform a comprehensive analysis of workflow scheduling algorithms with a focus on quantifying the effect of scheduling challenges that have so far been mostly neglected, such as delays between scheduler invocations or partially unknown task durations. Our results indicate that network models used by many previous works might produce results that are off by an order of magnitude in comparison to a more realistic model. Additionally, we show that certain implementation details of scheduling algorithms which are often neglected can have a large effect on the scheduler's performance, and they should thus be described in great detail to enable proper evaluation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/15/2018

An Efficient Fault Tolerant Workflow Scheduling Approach using Replication Heuristics and Checkpointing in the Cloud

Scientific workflows have been predominantly used for complex and large ...
research
09/14/2018

Multiple Workflows Scheduling in Multi-tenant Distributed Systems: A Taxonomy and Future Directions

Scientific workflows are commonly used to automate scientific experiment...
research
01/14/2022

Energy-efficient workflow scheduling based on workflow structures under deadline and budget constraints in the cloud

The utilization of cloud environments to deploy scientific workflow appl...
research
05/26/2018

Data-Aware Approximate Workflow Scheduling

Optimization of data placement in complex scientific workflows has becom...
research
02/15/2023

How Workflow Engines Should Talk to Resource Managers: A Proposal for a Common Workflow Scheduling Interface

Scientific workflow management systems (SWMSs) and resource managers tog...
research
03/24/2017

Calendar.help: Designing a Workflow-Based Scheduling Agent with Humans in the Loop

Although information workers may complain about meetings, they are an es...
research
05/04/2020

MARS: Multi-Scalable Actor-Critic Reinforcement Learning Scheduler

In this paper, we introduce a new scheduling algorithm MARS based on a c...

Please sign up or login with your details

Forgot password? Click here to reset