Learning 2-opt Heuristics for the Traveling Salesman Problem via Deep Reinforcement Learning

04/03/2020
by   Paulo R. de O. da Costa, et al.
6

Recent works using deep learning to solve the Traveling Salesman Problem (TSP) have focused on learning construction heuristics. Such approaches find TSP solutions of good quality but require additional procedures such as beam search and sampling to improve solutions and achieve state-of-the-art performance. However, few studies have focused on improvement heuristics, where a given solution is improved until reaching a near-optimal one. In this work, we propose to learn a local search heuristic based on 2-opt operators via deep reinforcement learning. We propose a policy gradient algorithm to learn a stochastic policy that selects 2-opt operations given a current solution. Moreover, we introduce a policy neural network that leverages a pointing attention mechanism, which unlike previous works, can be easily extended to more general k-opt moves. Our results show that the learned policies can improve even over random initial solutions and approach near-optimal solutions at a faster rate than previous state-of-the-art deep learning methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/12/2019

Learning Improvement Heuristics for Solving the Travelling Salesman Problem

Recent studies in using deep learning to solve the Travelling Salesman P...
research
10/06/2021

Improving Generalization of Deep Reinforcement Learning-based TSP Solvers

Recent work applying deep reinforcement learning (DRL) to solve travelin...
research
07/09/2021

Attend2Pack: Bin Packing through Deep Reinforcement Learning with Attention

This paper seeks to tackle the bin packing problem (BPP) through a learn...
research
09/30/2018

Learning to Progressively Plan

For problem solving, making reactive decisions based on problem descript...
research
07/20/2018

Learning Heuristics for Automated Reasoning through Deep Reinforcement Learning

We demonstrate how to learn efficient heuristics for automated reasoning...
research
11/20/2022

Learning to Search for Job Shop Scheduling via Deep Reinforcement Learning

Recent studies in using deep reinforcement learning (DRL) to solve Job-s...
research
12/19/2020

Multi-Decoder Attention Model with Embedding Glimpse for Solving Vehicle Routing Problems

We present a novel deep reinforcement learning method to learn construct...

Please sign up or login with your details

Forgot password? Click here to reset