Learning Large Neighborhood Search Policy for Integer Programming

11/01/2021
by   Yaoxin Wu, et al.
6

We propose a deep reinforcement learning (RL) method to learn large neighborhood search (LNS) policy for integer programming (IP). The RL policy is trained as the destroy operator to select a subset of variables at each step, which is reoptimized by an IP solver as the repair operator. However, the combinatorial number of variable subsets prevents direct application of typical RL algorithms. To tackle this challenge, we represent all subsets by factorizing them into binary decisions on each variable. We then design a neural network to learn policies for each variable in parallel, trained by a customized actor-critic algorithm. We evaluate the proposed method on four representative IP problems. Results show that it can find better solutions than SCIP in much less time, and significantly outperform other LNS baselines with the same runtime. Moreover, these advantages notably persist when the policies generalize to larger problems. Further experiments with Gurobi also reveal that our method can outperform this state-of-the-art commercial solver within the same time limit.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/29/2022

Zeroth-Order Actor-Critic

Zeroth-order optimization methods and policy gradient based first-order ...
research
06/11/2019

Reinforcement Learning for Integer Programming: Learning to Cut

Integer programming (IP) is a general optimization framework widely appl...
research
11/01/2022

Operator Selection in Adaptive Large Neighborhood Search using Deep Reinforcement Learning

Large Neighborhood Search (LNS) is a popular heuristic for solving combi...
research
09/08/2020

Evolutionary Reinforcement Learning via Cooperative Coevolutionary Negatively Correlated Search

Evolutionary algorithms (EAs) have been successfully applied to optimize...
research
02/27/2023

Learning Large Neighborhood Search for Vehicle Routing in Airport Ground Handling

Dispatching vehicle fleets to serve flights is a key task in airport gro...
research
07/05/2022

Learning to Accelerate Approximate Methods for Solving Integer Programming via Early Fixing

Integer programming (IP) is an important and challenging problem. Approx...
research
04/25/2022

Efficient Neural Neighborhood Search for Pickup and Delivery Problems

We present an efficient Neural Neighborhood Search (N2S) approach for pi...

Please sign up or login with your details

Forgot password? Click here to reset