Hierarchical Deep Reinforcement Learning Approach for Multi-Objective Scheduling With Varying Queue Sizes

07/17/2020
by   Yoni Birman, et al.
0

Multi-objective task scheduling (MOTS) is the task scheduling while optimizing multiple and possibly contradicting constraints. A challenging extension of this problem occurs when every individual task is a multi-objective optimization problem by itself. While deep reinforcement learning (DRL) has been successfully applied to complex sequential problems, its application to the MOTS domain has been stymied by two challenges. The first challenge is the inability of the DRL algorithm to ensure that every item is processed identically regardless of its position in the queue. The second challenge is the need to manage large queues, which results in large neural architectures and long training times. In this study we present MERLIN, a robust, modular and near-optimal DRL-based approach for multi-objective task scheduling. MERLIN applies a hierarchical approach to the MOTS problem by creating one neural network for the processing of individual tasks and another for the scheduling of the overall queue. In addition to being smaller and with shorted training times, the resulting architecture ensures that an item is processed in the same manner regardless of its position in the queue. Additionally, we present a novel approach for efficiently applying DRL-based solutions on very large queues, and demonstrate how we effectively scale MERLIN to process queue sizes that are larger by orders of magnitude than those on which it was trained. Extensive evaluation on multiple queue sizes show that MERLIN outperforms multiple well-known baselines by a large margin (>22

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2019

Deep Reinforcement Learning for Multi-objective Optimization

This study proposes an end-to-end framework for solving multi-objective ...
research
05/26/2022

Deep Reinforcement Learning with Adaptive Hierarchical Reward for MultiMulti-Phase Multi Multi-Objective Dexterous Manipulation

Dexterous manipulation tasks usually have multiple objectives, and the p...
research
11/01/2022

Learning Adaptive Evolutionary Computation for Solving Multi-Objective Optimization Problems

Multi-objective evolutionary algorithms (MOEAs) are widely used to solve...
research
07/07/2022

DRL-ISP: Multi-Objective Camera ISP with Deep Reinforcement Learning

In this paper, we propose a multi-objective camera ISP framework that ut...
research
12/14/2020

A Reinforcement Learning Formulation of the Lyapunov Optimization: Application to Edge Computing Systems with Queue Stability

In this paper, a deep reinforcement learning (DRL)-based approach to the...
research
10/03/2022

Learning Minimally-Violating Continuous Control for Infeasible Linear Temporal Logic Specifications

This paper explores continuous-time control synthesis for target-driven ...
research
05/06/2022

Vehicle management in a modular production context using Deep Q-Learning

We investigate the feasibility of deploying Deep-Q based deep reinforcem...

Please sign up or login with your details

Forgot password? Click here to reset