Math Programming based Reinforcement Learning for Multi-Echelon Inventory Management

12/04/2021
by   Pavithra Harsha, et al.
0

Reinforcement learning has lead to considerable break-throughs in diverse areas such as robotics, games and many others. But the application to RL in complex real-world decision making problems remains limited. Many problems in operations management (inventory and revenue management, for example) are characterized by large action spaces and stochastic system dynamics. These characteristics make the problem considerably harder to solve for existing RL methods that rely on enumeration techniques to solve per step action problems. To resolve these issues, we develop Programmable Actor Reinforcement Learning (PARL), a policy iteration method that uses techniques from integer programming and sample average approximation. Analytically, we show that the for a given critic, the learned policy in each iteration converges to the optimal policy as the underlying samples of the uncertainty go to infinity. Practically, we show that a properly selected discretization of the underlying uncertain distribution can yield near optimal actor policy even with very few samples from the underlying uncertainty. We then apply our algorithm to real-world inventory management problems with complex supply chain structures and show that PARL outperforms state-of-the-art RL and inventory optimization methods in these settings. We find that PARL outperforms commonly used base stock heuristic by 44.7 across different supply chain environments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2020

Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains

This paper describes the application of reinforcement learning (RL) to m...
research
04/18/2023

Cooperative Multi-Agent Reinforcement Learning for Inventory Management

With Reinforcement Learning (RL) for inventory management (IM) being a n...
research
04/20/2022

Deep Reinforcement Learning for a Two-Echelon Supply Chain with Seasonal Demand

This paper leverages recent developments in reinforcement learning and d...
research
09/17/2022

Quantum Computing Methods for Supply Chain Management

Quantum computing is expected to have transformative influences on many ...
research
01/12/2022

Multi-echelon Supply Chains with Uncertain Seasonal Demands and Lead Times Using Deep Reinforcement Learning

We address the problem of production planning and distribution in multi-...
research
03/02/2022

A Learning Based Framework for Handling Uncertain Lead Times in Multi-Product Inventory Management

Most existing literature on supply chain and inventory management consid...
research
08/20/2017

A Deep Q-Network for the Beer Game: A Reinforcement Learning algorithm to Solve Inventory Optimization Problems

The beer game is a widely used in-class game that is played in supply ch...

Please sign up or login with your details

Forgot password? Click here to reset