Learning to Optimize DAG Scheduling in Heterogeneous Environment

03/09/2021
by   Jinhong Luo, et al.
0

Directed Acyclic Graph (DAG) scheduling in a heterogeneous environment is aimed at assigning the on-the-fly jobs to a cluster of heterogeneous computing executors in order to minimize the makespan while meeting all requirements of scheduling. The problem gets more attention than ever since the rapid development of heterogeneous cloud computing. A little reduction of makespan of DAG scheduling could both bring huge profits to the service providers and increase the level of service of users. Although DAG scheduling plays an important role in cloud computing industries, existing solutions still have huge room for improvement, especially in making use of topological dependencies between jobs. In this paper, we propose a task-duplication based learning algorithm, called Lachesis, for the distributed DAG scheduling problem. In our approach, it first perceives the topological dependencies between jobs using a specially designed graph convolutional network (GCN) to select the most likely task to be executed. Then the task is assigned to a specific executor with the consideration of duplicating all its precedent tasks according to a sophisticated heuristic method. We have conducted extensive experiments over standard workload data to evaluate our solution. The experimental results suggest that the proposed algorithm can achieve at most 26.7% reduction of makespan and 35.2% improvement of speedup ratio over seven strong baseline algorithms, including state-of-the-art heuristics methods and a variety of deep reinforcement learning based algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/10/2018

Reinforcement-Learning-based Foresighted Task Scheduling in Cloud Computing

With the apperance of cloud computing, users receive computing resources...
research
03/09/2019

Machine Learning Based Prediction and Classification of Computational Jobs in Cloud Computing Centers

With the rapid growth of the data volume and the fast increasing of the ...
research
12/13/2011

Hybrid Heuristic-Based Artificial Immune System for Task Scheduling

Task scheduling problem in heterogeneous systems is the process of alloc...
research
10/14/2019

In Search of a Fast and Efficient Serverless DAG Engine

Python-written data analytics applications can be modeled as and compile...
research
11/29/2022

ReAssigner: A Plug-and-Play Virtual Machine Scheduling Intensifier for Heterogeneous Requests

With the rapid development of cloud computing, virtual machine schedulin...
research
05/15/2020

DeepSoCS: A Neural Scheduler for Heterogeneous System-on-Chip Resource Scheduling

In this paper, we present a novel scheduling solution for a class of Sys...
research
09/02/2023

DeepScaler: Holistic Autoscaling for Microservices Based on Spatiotemporal GNN with Adaptive Graph Learning

Autoscaling functions provide the foundation for achieving elasticity in...

Please sign up or login with your details

Forgot password? Click here to reset