TauRieL: Targeting Traveling Salesman Problem with a deep reinforcement learning inspired architecture

05/14/2019
by   Gorker Alp Malazgirt, et al.
0

In this paper, we propose TauRieL and target Traveling Salesman Problem (TSP) since it has broad applicability in theoretical and applied sciences. TauRieL utilizes an actor-critic inspired architecture that adopts ordinary feedforward nets to obtain a policy update vector v. Then, we use v to improve the state transition matrix from which we generate the policy. Also, the state transition matrix allows the solver to initialize from precomputed solutions such as nearest neighbors. In an online learning setting, TauRieL unifies the training and the search where it can generate near-optimal results in seconds. The input to the neural nets in the actor-critic architecture are raw 2-D inputs, and the design idea behind this decision is to keep neural nets relatively smaller than the architectures with wide embeddings with the tradeoff of omitting any distributed representations of the embeddings. Consequently, TauRieL generates TSP solutions two orders of magnitude faster per TSP instance as compared to state-of-the-art offline techniques with a performance impact of 6.1% in the worst case.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/14/2019

On the Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost

Despite the empirical success of the actor-critic algorithm, its theoret...
research
02/23/2021

Good Actors can come in Smaller Sizes: A Case Study on the Value of Actor-Critic Asymmetry

Actors and critics in actor-critic reinforcement learning algorithms are...
research
10/28/2017

Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning

We propose a fully distributed actor-critic algorithm approximated by de...
research
12/18/2022

Neural Coreference Resolution based on Reinforcement Learning

The target of a coreference resolution system is to cluster all mentions...

Please sign up or login with your details

Forgot password? Click here to reset