Learning Heuristic Selection with Dynamic Algorithm Configuration

06/15/2020
by   David Speck, et al.
0

A key challenge in satisfying planning is to use multiple heuristics within one heuristic search. An aggregation of multiple heuristic estimates, for example by taking the maximum, has the disadvantage that bad estimates of a single heuristic can negatively affect the whole search. Since the performance of a heuristic varies from instance to instance, approaches such as algorithm selection can be successfully applied. In addition, alternating between multiple heuristics during the search makes it possible to use all heuristics equally and improve performance. However, all these approaches ignore the internal search dynamics of a planning system, which can help to select the most helpful heuristics for the current expansion step. We show that dynamic algorithm configuration can be used for dynamic heuristic selection which takes into account the internal search dynamics of a planning system. Furthermore, we prove that this approach generalizes over existing approaches and that it can exponentially improve the performance of the heuristic search. To learn dynamic heuristic selection, we propose an approach based on reinforcement learning and show empirically that domain-wise learned policies, which take the internal search dynamics of a planning system into account, can exceed existing approaches in terms of coverage.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2019

A novel approach to model exploration for value function learning

Planning and Learning are complementary approaches. Planning relies on d...
research
03/25/2023

Heuristic Search for Multi-Objective Probabilistic Planning

Heuristic search is a powerful approach that has successfully been appli...
research
07/17/2013

DASH: Dynamic Approach for Switching Heuristics

Complete tree search is a highly effective method for tackling MIP probl...
research
09/29/2011

Multiple-Goal Heuristic Search

This paper presents a new framework for anytime heuristic search where t...
research
05/15/2020

HeCSON: Heuristic for Configuration Selectionin Optical Network Planning

We present a transceiver configuration selection heuristic combining Enh...
research
08/15/2022

Non-Blocking Batch A* (Technical Report)

Heuristic search has traditionally relied on hand-crafted or programmati...
research
12/22/2022

TransPath: Learning Heuristics For Grid-Based Pathfinding via Transformers

Heuristic search algorithms, e.g. A*, are the commonly used tools for pa...

Please sign up or login with your details

Forgot password? Click here to reset