Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization

06/02/2020
by   Quentin Cappart, et al.
0

Combinatorial optimization has found applications in numerous fields, from aerospace to transportation planning and economics. The goal is to find an optimal solution among a finite set of possibilities. The well-known challenge one faces with combinatorial optimization is the state-space explosion problem: the number of possibilities grows exponentially with the problem size, which makes solving intractable for large problems. In the last years, deep reinforcement learning (DRL) has shown its promise for designing good heuristics dedicated to solve NP-hard combinatorial optimization problems. However, current approaches have two shortcomings: (1) they mainly focus on the standard travelling salesman problem and they cannot be easily extended to other problems, and (2) they only provide an approximate solution with no systematic ways to improve it or to prove optimality. In another context, constraint programming (CP) is a generic tool to solve combinatorial optimization problems. Based on a complete search procedure, it will always find the optimal solution if we allow an execution time large enough. A critical design choice, that makes CP non-trivial to use in practice, is the branching decision, directing how the search space is explored. In this work, we propose a general and hybrid approach, based on DRL and CP, for solving combinatorial optimization problems. The core of our approach is based on a dynamic programming formulation, that acts as a bridge between both techniques. We experimentally show that our solver is efficient to solve two challenging problems: the traveling salesman problem with time windows, and the 4-moments portfolio optimization problem. Results obtained show that the framework introduced outperforms the stand-alone RL and CP solutions, while being competitive with industrial solvers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/18/2021

SeaPearl: A Constraint Programming Solver guided by Reinforcement Learning

The design of efficient and generic algorithms for solving combinatorial...
research
11/26/2022

Domain-Independent Dynamic Programming: Generic State Space Search for Combinatorial Optimization

For combinatorial optimization problems, model-based approaches such as ...
research
06/22/2020

Constrained Combinatorial Optimization with Reinforcement Learning

This paper presents a framework to tackle constrained combinatorial opti...
research
05/15/2020

MineReduce: an approach based on data mining for problem size reduction

Hybrid variations of metaheuristics that include data mining strategies ...
research
05/30/2018

Generic CP-Supported CMSA for Binary Integer Linear Programs

Construct, Merge, Solve and Adapt (CMSA) is a general hybrid metaheurist...
research
06/02/2023

Symmetric Exploration in Combinatorial Optimization is Free!

Recently, deep reinforcement learning (DRL) has shown promise in solving...
research
07/12/2019

Learning to Handle Parameter Perturbations in Combinatorial Optimization: an Application to Facility Location

We present an approach to couple the resolution of Combinatorial Optimiz...

Please sign up or login with your details

Forgot password? Click here to reset