Learning Vehicle Routing Problems using Policy Optimisation

12/24/2020
by   Nasrin Sultana, et al.
0

Deep reinforcement learning (DRL) has been used to learn effective heuristics for solving complex combinatorial optimisation problem via policy networks and have demonstrated promising performance. Existing works have focused on solving (vehicle) routing problems as they have a nice balance between non-triviality and difficulty. State-of-the-art approaches learn a policy using reinforcement learning, and the learnt policy acts as a pseudo solver. These approaches have demonstrated good performance in some cases, but given the large search space typical combinatorial/routing problem, they can converge too quickly to poor policy. To prevent this, in this paper, we propose an approach name entropy regularised reinforcement learning (ERRL) that supports exploration by providing more stochastic policies, which tends to improve optimisation. Empirically, the low variance ERRL offers RL training fast and stable. We also introduce a combination of local search operators during test time, which significantly improves solution and complement ERRL. We qualitatively demonstrate that for vehicle routing problems, a policy with higher entropy can make the optimisation landscape smooth which makes it easier to optimise. The quantitative evaluation shows that the performance of the model is comparable with the state-of-the-art variants. In our evaluation, we experimentally illustrate that the model produces state-of-the-art performance on variants of Vehicle Routing problems such as Capacitated Vehicle Routing Problem (CVRP), Multiple Routing with Fixed Fleet Problems (MRPFF) and Travelling Salesman problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/31/2022

Sample-Efficient, Exploration-Based Policy Optimisation for Routing Problems

Model-free deep-reinforcement-based learning algorithms have been applie...
research
06/14/2022

Solving the capacitated vehicle routing problem with timing windows using rollouts and MAX-SAT

The vehicle routing problem is a well known class of NP-hard combinatori...
research
09/17/2021

Learning Enhanced Optimisation for Routing Problems

Deep learning approaches have shown promising results in solving routing...
research
01/05/2022

Supervised Permutation Invariant Networks for Solving the CVRP with Bounded Fleet Size

Learning to solve combinatorial optimization problems, such as the vehic...
research
07/22/2023

Unlocking Carbon Reduction Potential with Reinforcement Learning for the Three-Dimensional Loading Capacitated Vehicle Routing Problem

Heavy goods vehicles are vital backbones of the supply chain delivery sy...
research
10/26/2021

Learning Collaborative Policies to Solve NP-hard Routing Problems

Recently, deep reinforcement learning (DRL) frameworks have shown potent...
research
12/19/2020

Multi-Decoder Attention Model with Embedding Glimpse for Solving Vehicle Routing Problems

We present a novel deep reinforcement learning method to learn construct...

Please sign up or login with your details

Forgot password? Click here to reset