Learning Classical Planning Strategies with Policy Gradient

10/23/2018
by   Pawel Gomoluch, et al.
0

A common paradigm in classical planning is heuristic forward search. Forward search planners often rely on relatively simple best-first search algorithm, which remains fixed throughout the search process. In this paper, we introduce a novel search framework capable of alternating between several forward search approaches while solving a particular planning problem. Selection of the approach is performed using a trainable stochastic policy. This enables tailoring the search strategy to a particular distribution of planning problems and a selected performance metric, such as the IPC score or running time. We construct a strategy space using five search algorithms and a two-dimensional representation of the planner's state. Strategies are then trained on randomly generated planning problems using policy gradient. Experimental results show that the learner is able to discover domain-specific search strategies, thus improving the planner's performance with respect to the chosen metric.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/27/2019

Learning Neural Search Policies for Classical Planning

Heuristic forward search is currently the dominant paradigm in classical...
research
05/27/2011

Adaptive Parallel Iterative Deepening Search

Many of the artificial intelligence techniques developed to date rely on...
research
01/10/2013

A Bayesian Approach to Tackling Hard Computational Problems

We are developing a general framework for using learned Bayesian models ...
research
06/03/2011

The FF Planning System: Fast Plan Generation Through Heuristic Search

We describe and evaluate the algorithmic techniques that are used in the...
research
06/01/2011

Planning Graph as a (Dynamic) CSP: Exploiting EBL, DDB and other CSP Search Techniques in Graphplan

This paper reviews the connections between Graphplan's planning-graph an...
research
02/28/2020

Policy-Aware Model Learning for Policy Gradient Methods

This paper considers the problem of learning a model in model-based rein...
research
02/12/2021

Robust and Efficient Planning using Adaptive Entropy Tree Search

In this paper, we present the Adaptive EntropyTree Search (ANTS) algorit...

Please sign up or login with your details

Forgot password? Click here to reset