Deep Reinforcement Learning for Solving the Vehicle Routing Problem

02/12/2018
by   MohammadReza Nazari, et al.
0

We present an end-to-end framework for solving Vehicle Routing Problem (VRP) using deep reinforcement learning. In this approach, we train a single model that finds near-optimal solutions for problem instances sampled from a given distribution, only by observing the reward signals and following feasibility rules. Our model represents a parameterized stochastic policy, and by applying a policy gradient algorithm to optimize its parameters, the trained model produces the solution as a sequence of consecutive actions in real time, without the need to re-train for every new problem instance. Our method is faster in both training and inference than a recent method that solves the Traveling Salesman Problem (TSP), with nearly identical solution quality. On the more general VRP, our approach outperforms classical heuristics on medium-sized instances in both solution quality and computation time (after training). Our proposed framework can be applied to variants of the VRP such as the stochastic VRP, and has the potential to be applied more generally to combinatorial optimization problems.

READ FULL TEXT
research
06/05/2021

Reinforcement Learning for Assignment Problem with Time Constraints

We present an end-to-end framework for the Assignment Problem with multi...
research
10/05/2020

Deep Reinforcement Learning for Electric Vehicle Routing Problem with Time Windows

The past decade has seen a rapid penetration of electric vehicles (EV) i...
research
02/10/2023

Reinforcement Learning Aided Sequential Optimization for Unsignalized Intersection Management of Robot Traffic

We consider the problem of optimal unsignalized intersection management ...
research
12/06/2019

Deep Reinforcement Learning for Routing a Heterogeneous Fleet of Vehicles

Motivated by the promising advances of deep-reinforcement learning (DRL)...
research
10/22/2020

Reinforcement Learning with Combinatorial Actions: An Application to Vehicle Routing

Value-function-based methods have long played an important role in reinf...
research
09/13/2022

Learning to Solve Multiple-TSP with Time Window and Rejections via Deep Reinforcement Learning

We propose a manager-worker framework based on deep reinforcement learni...
research
03/23/2023

RLOR: A Flexible Framework of Deep Reinforcement Learning for Operation Research

Reinforcement learning has been applied in operation research and has sh...

Please sign up or login with your details

Forgot password? Click here to reset