Fast Approximate Solutions using Reinforcement Learning for Dynamic Capacitated Vehicle Routing with Time Windows

02/24/2021
by   Nazneen N Sultana, et al.
0

This paper develops an inherently parallelised, fast, approximate learning-based solution to the generic class of Capacitated Vehicle Routing with Time Windows and Dynamic Routing (CVRP-TWDR). Considering vehicles in a fleet as decentralised agents, we postulate that using reinforcement learning (RL) based adaptation is a key enabler for real-time route formation in a dynamic environment. The methodology allows each agent (vehicle) to independently evaluate the value of serving each customer, and uses a centralised allocation heuristic to finalise the allocations based on the generated values. We show that the solutions produced by this method on standard datasets are significantly faster than exact formulations and state-of-the-art meta-heuristics, while being reasonably close to optimal in terms of solution quality. We describe experiments in both the static case (when all customer demands and time windows are known in advance) as well as the dynamic case (where customers can `pop up' at any time during execution). The results with a single trained model on large, out-of-distribution test data demonstrate the scalability and flexibility of the proposed approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/21/2023

Hybrid Genetic Search for Dynamic Vehicle Routing with Time Windows

The dynamic vehicle routing problem with time windows (DVRPTW) is a gene...
research
05/10/2019

Fast delta evaluation for the Vehicle Routing Problem with Multiple Time Windows

In many applications of vehicle routing, a set of time windows are feasi...
research
08/27/2020

Balanced dynamic multiple travelling salesmen: algorithms and continuous approximations

Dynamic routing occurs when customers are not known in advance, e.g. for...
research
07/20/2022

Learning to Solve Soft-Constrained Vehicle Routing Problems with Lagrangian Relaxation

Vehicle Routing Problems (VRPs) in real-world applications often come wi...
research
01/25/2018

An Improved Tabu Search Heuristics for Static Dial-A-Ride Problem

Multi-vehicle routing has become increasingly important with the rapid d...
research
04/12/2022

A Reinforcement Learning Approach for Electric Vehicle Routing Problem with Vehicle-to-Grid Supply

The use of electric vehicles (EV) in the last mile is appealing from bot...
research
12/06/2019

A case study of Consistent Vehicle Routing Problem with Time Windows

We develop a heuristic solution method for the Consistent Vehicle Routin...

Please sign up or login with your details

Forgot password? Click here to reset