Training a Deep Q-Learning Agent Inside a Generic Constraint Programming Solver

01/05/2023
by   Tom Marty, et al.
7

Constraint programming is known for being an efficient approach for solving combinatorial problems. Important design choices in a solver are the branching heuristics, which are designed to lead the search to the best solutions in a minimum amount of time. However, developing these heuristics is a time-consuming process that requires problem-specific expertise. This observation has motivated many efforts to use machine learning to automatically learn efficient heuristics without expert intervention. To the best of our knowledge, it is still an open research question. Although several generic variable-selection heuristics are available in the literature, the options for a generic value-selection heuristic are more scarce. In this paper, we propose to tackle this issue by introducing a generic learning procedure that can be used to obtain a value-selection heuristic inside a constraint programming solver. This has been achieved thanks to the combination of a deep Q-learning algorithm, a tailored reward signal, and a heterogeneous graph neural network architecture. Experiments on graph coloring, maximum independent set, and maximum cut problems show that our framework is able to find better solutions close to optimality without requiring a large amounts of backtracks while being generic.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/18/2014

Counting-Based Search: Branching Heuristics for Constraint Satisfaction Problems

Designing a search heuristic for constraint programming that is reliable...
research
02/18/2021

SeaPearl: A Constraint Programming Solver guided by Reinforcement Learning

The design of efficient and generic algorithms for solving combinatorial...
research
05/10/2018

Learning Robust Search Strategies Using a Bandit-Based Approach

Effective solving of constraint problems often requires choosing good or...
research
03/06/2012

Search Combinators

The ability to model search in a constraint solver can be an essential a...
research
09/02/2010

Experimental Evaluation of Branching Schemes for the CSP

The search strategy of a CP solver is determined by the variable and val...
research
08/22/2022

One Model, Any CSP: Graph Neural Networks as Fast Global Search Heuristics for Constraint Satisfaction

We propose a universal Graph Neural Network architecture which can be tr...
research
10/31/2011

A Constraint Programming Approach for Solving a Queueing Control Problem

In a facility with front room and back room operations, it is useful to ...

Please sign up or login with your details

Forgot password? Click here to reset