Towards Standardizing Reinforcement Learning Approaches for Stochastic Production Scheduling

04/16/2021
by   Alexandru Rinciog, et al.
0

Recent years have seen a rise in interest in terms of using machine learning, particularly reinforcement learning (RL), for production scheduling problems of varying degrees of complexity. The general approach is to break down the scheduling problem into a Markov Decision Process (MDP), whereupon a simulation implementing the MDP is used to train an RL agent. Since existing studies rely on (sometimes) complex simulations for which the code is unavailable, the experiments presented are hard, or, in the case of stochastic environments, impossible to reproduce accurately. Furthermore, there is a vast array of RL designs to choose from. To make RL methods widely applicable in production scheduling and work out their strength for the industry, the standardization of model descriptions - both production setup and RL design - and validation scheme are a prerequisite. Our contribution is threefold: First, we standardize the description of production setups used in RL studies based on established nomenclature. Secondly, we classify RL design choices from existing publications. Lastly, we propose recommendations for a validation scheme focusing on reproducibility and sufficient benchmarking.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/10/2021

A Validation Tool for Designing Reinforcement Learning Environments

Reinforcement learning (RL) has gained increasing attraction in the acad...
research
12/06/2019

Observational Overfitting in Reinforcement Learning

A major component of overfitting in model-free reinforcement learning (R...
research
09/08/2020

Reinforcement Learning on Job Shop Scheduling Problems Using Graph Networks

This paper presents a novel approach for job shop scheduling problems us...
research
03/10/2021

A Two-stage Framework and Reinforcement Learning-based Optimization Algorithms for Complex Scheduling Problems

There hardly exists a general solver that is efficient for scheduling pr...
research
06/25/2021

Branch Prediction as a Reinforcement Learning Problem: Why, How and Case Studies

Recent years have seen stagnating improvements to branch predictor (BP) ...
research
03/02/2022

Reliable validation of Reinforcement Learning Benchmarks

Reinforcement Learning (RL) is one of the most dynamic research areas in...
research
03/01/2022

Distributional Reinforcement Learning for Scheduling of (Bio)chemical Production Processes

Reinforcement Learning (RL) has recently received significant attention ...

Please sign up or login with your details

Forgot password? Click here to reset