Constrained Combinatorial Optimization with Reinforcement Learning

06/22/2020
by   Ruben Solozabal, et al.
0

This paper presents a framework to tackle constrained combinatorial optimization problems using deep Reinforcement Learning (RL). To this end, we extend the Neural Combinatorial Optimization (NCO) theory in order to deal with constraints in its formulation. Notably, we propose defining constrained combinatorial problems as fully observable Constrained Markov Decision Processes (CMDP). In that context, the solution is iteratively constructed based on interactions with the environment. The model, in addition to the reward signal, relies on penalty signals generated from constraint dissatisfaction to infer a policy that acts as a heuristic algorithm. Moreover, having access to the complete state representation during the optimization process allows us to rely on memory-less architectures, enhancing the results obtained in previous sequence-to-sequence approaches. Conducted experiments on the constrained Job Shop and Resource Allocation problems prove the superiority of the proposal for computing rapid solutions when compared to classical heuristic, metaheuristic, and Constraint Programming (CP) solvers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/02/2020

Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization

Combinatorial optimization has found applications in numerous fields, fr...
research
01/03/2019

Constrained optimization under uncertainty for decision-making problems: Application to Real-Time Strategy games

Decision-making problems can be modeled as combinatorial optimization pr...
research
04/06/2021

Ecole: A Library for Learning Inside MILP Solvers

In this paper we describe Ecole (Extensible Combinatorial Optimization L...
research
11/12/2019

Combinatorial Optimization by Graph Pointer Networks and Hierarchical Reinforcement Learning

In this work, we introduce Graph Pointer Networks (GPNs) trained using r...
research
02/11/2022

Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems

Over the recent years, reinforcement learning (RL) has shown impressive ...
research
01/09/2023

BQ-NCO: Bisimulation Quotienting for Generalizable Neural Combinatorial Optimization

Despite the success of Neural Combinatorial Optimization methods for end...
research
01/03/2022

Neural combinatorial optimization beyond the TSP: Existing architectures under-represent graph structure

Recent years have witnessed the promise that reinforcement learning, cou...

Please sign up or login with your details

Forgot password? Click here to reset