Deep Inventory Management

10/06/2022
by   Dhruv Madeka, et al.
0

We present a Deep Reinforcement Learning approach to solving a periodic review inventory control system with stochastic vendor lead times, lost sales, correlated demand, and price matching. While this dynamic program has historically been considered intractable, we show that several policy learning approaches are competitive with or outperform classical baseline approaches. In order to train these algorithms, we develop novel techniques to convert historical data into a simulator. We also present a model-based reinforcement learning procedure (Direct Backprop) to solve the dynamic periodic review inventory control problem by constructing a differentiable simulator. Under a variety of metrics Direct Backprop outperforms model-free RL and newsvendor baselines, in both simulations and real-world deployments.

READ FULL TEXT
research
07/16/2019

Model-free Control of Chaos with Continuous Deep Q-learning

The OGY method is one of control methods for a chaotic system. In the me...
research
08/12/2020

Overcoming Model Bias for Robust Offline Deep Reinforcement Learning

State-of-the-art reinforcement learning algorithms mostly rely on being ...
research
10/20/2022

Deep reinforcement learning oriented for real world dynamic scenarios

Autonomous navigation in dynamic environments is a complex but essential...
research
12/03/2021

Improving the Robustness of Reinforcement Learning Policies with ℒ_1 Adaptive Control

A reinforcement learning (RL) control policy trained in a nominal enviro...
research
07/12/2018

The Bottleneck Simulator: A Model-based Deep Reinforcement Learning Approach

Deep reinforcement learning has recently shown many impressive successes...
research
08/23/2018

LIFT: Reinforcement Learning in Computer Systems by Learning From Demonstrations

Reinforcement learning approaches have long appealed to the data managem...
research
12/07/2020

Efficient Reservoir Management through Deep Reinforcement Learning

Dams impact downstream river dynamics through flow regulation and disrup...

Please sign up or login with your details

Forgot password? Click here to reset