E-commerce warehousing: learning a storage policy

01/21/2021
by   Adrien Rimélé, et al.
0

E-commerce with major online retailers is changing the way people consume. The goal of increasing delivery speed while remaining cost-effective poses significant new challenges for supply chains as they race to satisfy the growing and fast-changing demand. In this paper, we consider a warehouse with a Robotic Mobile Fulfillment System (RMFS), in which a fleet of robots stores and retrieves shelves of items and brings them to human pickers. To adapt to changing demand, uncertainty, and differentiated service (e.g., prime vs. regular), one can dynamically modify the storage allocation of a shelf. The objective is to define a dynamic storage policy to minimise the average cycle time used by the robots to fulfil requests. We propose formulating this system as a Partially Observable Markov Decision Process, and using a Deep Q-learning agent from Reinforcement Learning, to learn an efficient real-time storage policy that leverages repeated experiences and insightful forecasts using simulations. Additionally, we develop a rollout strategy to enhance our method by leveraging more information available at a given time step. Using simulations to compare our method to traditional storage rules used in the industry showed preliminary results up to 14% better in terms of travelling times.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/31/2021

Supervised learning and tree search for real-time storage allocation in Robotic Mobile Fulfillment Systems

A Robotic Mobile Fulfillment System is a robotised parts-to-picker syste...
research
10/28/2017

Distributed Server Allocation for Content Delivery Networks

We propose a dynamic formulation of file-sharing networks in terms of an...
research
02/26/2019

Optimal and Fast Real-time Resources Slicing with Deep Dueling Neural Networks

Effective network slicing requires an infrastructure/network provider to...
research
05/08/2023

Goal-oriented inference of environment from redundant observations

The agent learns to organize decision behavior to achieve a behavioral g...
research
11/14/2017

A unified decision making framework for supply and demand management in microgrid networks

This paper considers two important problems - on the supply-side and dem...
research
01/12/2018

Active repositioning of storage units in Robotic Mobile Fulfillment Systems

In our work we focus on Robotic Mobile Fulfillment Systems in e-commerce...

Please sign up or login with your details

Forgot password? Click here to reset