Deep reinforcement learning for the dynamic vehicle dispatching problem: An event-based approach

The dynamic vehicle dispatching problem corresponds to deciding which vehicles to assign to requests that arise stochastically over time and space. It emerges in diverse areas, such as in the assignment of trucks to loads to be transported; in emergency systems; and in ride-hailing services. In this paper, we model the problem as a semi-Markov decision process, which allows us to treat time as continuous. In this setting, decision epochs coincide with discrete events whose time intervals are random. We argue that an event-based approach substantially reduces the combinatorial complexity of the decision space and overcomes other limitations of discrete-time models often proposed in the literature. In order to test our approach, we develop a new discrete-event simulator and use double deep q-learning to train our decision agents. Numerical experiments are carried out in realistic scenarios using data from New York City. We compare the policies obtained through our approach with heuristic policies often used in practice. Results show that our policies exhibit better average waiting times, cancellation rates and total service times, with reduction in average waiting times of up to 50 other tested heuristic policies.

READ FULL TEXT

page 12

page 14

page 25

page 26

page 33

page 34

page 35

research
04/24/2021

A Deep Reinforcement Learning Approach for the Meal Delivery Problem

We consider a meal delivery service fulfilling dynamic customer requests...
research
01/18/2022

A sojourn-based approach to semi-Markov Reinforcement Learning

In this paper we introduce a new approach to discrete-time semi-Markov d...
research
09/20/2021

A Reinforcement Learning Approach to the Stochastic Cutting Stock Problem

We propose a formulation of the stochastic cutting stock problem as a di...
research
01/30/2023

SMDP-Based Dynamic Batching for Efficient Inference on GPU-Based Platforms

In up-to-date machine learning (ML) applications on cloud or edge comput...
research
07/05/2023

Surge Routing: Event-informed Multiagent Reinforcement Learning for Autonomous Rideshare

Large events such as conferences, concerts and sports games, often cause...

Please sign up or login with your details

Forgot password? Click here to reset