Population-Based Reinforcement Learning for Combinatorial Optimization

10/07/2022
by   Nathan Grinsztajn, et al.
0

Applying reinforcement learning (RL) to combinatorial optimization problems is attractive as it removes the need for expert knowledge or pre-solved instances. However, it is unrealistic to expect an agent to solve these (often NP-)hard problems in a single shot at inference due to their inherent complexity. Thus, leading approaches often implement additional search strategies, from stochastic sampling and beam-search to explicit fine-tuning. In this paper, we argue for the benefits of learning a population of complementary policies, which can be simultaneously rolled out at inference. To this end, we introduce Poppy, a simple theoretically grounded training procedure for populations. Instead of relying on a predefined or hand-crafted notion of diversity, Poppy induces an unsupervised specialization targeted solely at maximizing the performance of the population. We show that Poppy produces a set of complementary policies, and obtains state-of-the-art RL results on three popular NP-hard problems: the traveling salesman (TSP), the capacitated vehicle routing (CVRP), and 0-1 knapsack (KP) problems. On TSP specifically, Poppy outperforms the previous state-of-the-art, dividing the optimality gap by 5 while reducing the inference time by more than an order of magnitude.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/30/2020

POMO: Policy Optimization with Multiple Optima for Reinforcement Learning

In neural combinatorial optimization (CO), reinforcement learning (RL) c...
research
09/09/2019

Exploratory Combinatorial Optimization with Reinforcement Learning

Many real-world problems can be reduced to combinatorial optimization on...
research
01/14/2022

Reinforcement Learning to Solve NP-hard Problems: an Application to the CVRP

In this paper, we evaluate the use of Reinforcement Learning (RL) to sol...
research
06/09/2021

Efficient Active Search for Combinatorial Optimization Problems

Recently numerous machine learning based methods for combinatorial optim...
research
05/27/2022

Learning to Solve Combinatorial Graph Partitioning Problems via Efficient Exploration

From logistics to the natural sciences, combinatorial optimisation on gr...
research
07/21/2023

JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning

In this paper, we present JoinGym, an efficient and lightweight query op...
research
12/24/2021

An Efficient Combinatorial Optimization Model Using Learning-to-Rank Distillation

Recently, deep reinforcement learning (RL) has proven its feasibility in...

Please sign up or login with your details

Forgot password? Click here to reset