Deep Policy Dynamic Programming for Vehicle Routing Problems

02/23/2021
by   Wouter Kool, et al.
0

Routing problems are a class of combinatorial problems with many practical applications. Recently, end-to-end deep learning methods have been proposed to learn approximate solution heuristics for such problems. In contrast, classical dynamic programming (DP) algorithms can find optimal solutions, but scale badly with the problem size. We propose Deep Policy Dynamic Programming (DPDP), which aims to combine the strengths of learned neural heuristics with those of DP algorithms. DPDP prioritizes and restricts the DP state space using a policy derived from a deep neural network, which is trained to predict edges from example solutions. We evaluate our framework on the travelling salesman problem (TSP) and the vehicle routing problem (VRP) and show that the neural policy improves the performance of (restricted) DP algorithms, making them competitive to strong alternatives such as LKH, while also outperforming other `neural approaches' for solving TSPs and VRPs with 100 nodes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2022

Domain-Independent Dynamic Programming: Generic State Space Search for Combinatorial Optimization

For combinatorial optimization problems, model-based approaches such as ...
research
02/11/2018

Differentiable Dynamic Programming for Structured Prediction and Attention

Dynamic programming (DP) solves a variety of structured combinatorial pr...
research
12/02/2018

End-to-end Learning of Convolutional Neural Net and Dynamic Programming for Left Ventricle Segmentation

Differentiable programming is able to combine different functions or pro...
research
03/21/2020

DP-Net: Dynamic Programming Guided Deep Neural Network Compression

In this work, we propose an effective scheme (called DP-Net) for compres...
research
05/14/2019

An analytical bound on the fleet size in vehicle routing problems: a dynamic programming approach

We present an analytical upper bound on the number of required vehicles ...
research
08/05/2020

Solving Dynamic Programming Problem by Pipeline Implementation on GPU

In this paper, we show the effectiveness of a pipeline implementation of...
research
06/11/2019

Macro-action Multi-timescale Dynamic Programming for Energy Management with Phase Change Materials

This paper focuses on home energy management systems (HEMS) in buildings...

Please sign up or login with your details

Forgot password? Click here to reset