DAN: Decentralized Attention-based Neural Network to Solve the MinMax Multiple Traveling Salesman Problem

09/09/2021
by   Yuhong Cao, et al.
115

The multiple traveling salesman problem (mTSP) is a well-known NP-hard problem with numerous real-world applications. In particular, this work addresses MinMax mTSP, where the objective is to minimize the max tour length (sum of Euclidean distances) among all agents. The mTSP is normally considered as a combinatorial optimization problem, but due to its computational complexity, search-based exact and heuristic algorithms become inefficient as the number of cities increases. Encouraged by the recent developments in deep reinforcement learning (dRL), this work considers the mTSP as a cooperative task and introduces a decentralized attention-based neural network method to solve the MinMax mTSP, named DAN. In DAN, agents learn fully decentralized policies to collaboratively construct a tour, by predicting the future decisions of other agents. Our model relies on the Transformer architecture, and is trained using multi-agent RL with parameter sharing, which provides natural scalability to the numbers of agents and cities. We experimentally demonstrate our model on small- to large-scale mTSP instances, which involve 50 to 1000 cities and 5 to 20 agents, and compare against state-of-the-art baselines. For small-scale problems (fewer than 100 cities), DAN is able to closely match the performance of the best solver available (OR Tools, a meta-heuristic solver) given the same computation time budget. In larger-scale instances, DAN outperforms both conventional and dRL-based solvers, while keeping computation times low, and exhibits enhanced collaboration among agents.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

page 8

research
03/09/2023

Intent-based Deep Reinforcement Learning for Multi-agent Informative Path Planning

In multi-agent informative path planning (MAIPP), agents must collective...
research
06/05/2023

Solving NP-hard Min-max Routing Problems as Sequential Generation with Equity Context

Min-max routing problems aim to minimize the maximum tour length among a...
research
10/08/2022

DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems

Recently, deep reinforcement learning (DRL) models have shown promising ...
research
07/07/2020

Curriculum learning for multilevel budgeted combinatorial problems

Learning heuristics for combinatorial optimization problems through grap...
research
03/10/2022

Organisations (de-)centralised to a greater or lesser degree for allocating cities in two Multiple Travelling Salesmen Problems

Decisions in organisations may be made either by a Central Authority (CA...
research
06/17/2020

Learning What to Defer for Maximum Independent Sets

Designing efficient algorithms for combinatorial optimization appears ub...
research
06/30/2011

Phase Transitions and Backbones of the Asymmetric Traveling Salesman Problem

In recent years, there has been much interest in phase transitions of co...

Please sign up or login with your details

Forgot password? Click here to reset