Inductive Bias-driven Reinforcement Learning For Efficient Schedules in Heterogeneous Clusters

09/04/2019
by   Subho S. Banerjee, et al.
1

The problem of scheduling of workloads onto heterogeneous processors (e.g., CPUs, GPUs, FPGAs) is of fundamental importance in modern datacenters. Most current approaches rely on building application/system-specific heuristics that have to be reinvented on a case-by-case basis. This can be prohibitively expensive and is untenable going forward. In this paper, we propose a domain-driven reinforcement learning (RL) model for scheduling that can be broadly applied to a large class of heterogeneous processors. The key novelty of our approach is (i) the RL model; and (ii) the significant reduction of training-data (using domain knowledge) and -time (using sampling based end-to-end gradient propagation). We demonstrate the approach using real world GPU and FPGA accelerated applications to produce scheduling policies that significantly outperform hand-tuned heuristics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/03/2018

Learning Scheduling Algorithms for Data Processing Clusters

Efficiently scheduling data processing jobs on distributed compute clust...
research
09/13/2023

Efficient Reinforcement Learning for Jumping Monopods

In this work, we consider the complex control problem of making a monopo...
research
08/19/2020

Intelligent Replication Management for HDFS Using Reinforcement Learning

Storage systems for cloud computing merge a large number of commodity co...
research
01/14/2022

Reinforcement Learning in Time-Varying Systems: an Empirical Study

Recent research has turned to Reinforcement Learning (RL) to solve chall...
research
12/13/2011

Hybrid Heuristic-Based Artificial Immune System for Task Scheduling

Task scheduling problem in heterogeneous systems is the process of alloc...
research
11/09/2020

Geometric Deep Reinforcement Learning for Dynamic DAG Scheduling

In practice, it is quite common to face combinatorial optimization probl...
research
01/25/2017

Towards Automatic Learning of Heuristics for Mechanical Transformations of Procedural Code

The current trends in next-generation exascale systems go towards integr...

Please sign up or login with your details

Forgot password? Click here to reset