Action Schema Networks: Generalised Policies with Deep Learning

09/13/2017
by   Sam Toyer, et al.
0

In this paper, we introduce the Action Schema Network (ASNet): a neural network architecture for learning generalised policies for probabilistic planning problems. By mimicking the relational structure of planning problems, ASNets are able to adopt a weight-sharing scheme which allows the network to be applied to any problem from a given planning domain. This allows the cost of training the network to be amortised over all problems in that domain. Further, we propose a training method which balances exploration and supervised training on small problems to produce a policy which remains robust when evaluated on larger problems. In experiments, we show that ASNet's learning capability allows it to significantly outperform traditional non-learning planners in several challenging domains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/04/2019

ASNets: Deep Learning for Generalised Planning

In this paper, we discuss the learning of generalised policies for proba...
research
02/18/2020

Generalized Neural Policies for Relational MDPs

A Relational Markov Decision Process (RMDP) is a first-order representat...
research
03/13/2019

Computing the Scope of Applicability for Acquired Task Knowledge in Experience-Based Planning Domains

Experience-based planning domains have been proposed to improve problem ...
research
02/27/2019

Learning Task Knowledge and its Scope of Applicability in Experience-Based Planning Domains

Experience-based planning domains (EBPDs) have been recently proposed to...
research
02/08/2019

Size Independent Neural Transfer for RDDL Planning

Neural planners for RDDL MDPs produce deep reactive policies in an offli...
research
06/08/2022

Planning with Dynamically Estimated Action Costs

Information about action costs is critical for real-world AI planning ap...
research
06/14/2017

Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics

The recent adaptation of deep neural network-based methods to reinforcem...

Please sign up or login with your details

Forgot password? Click here to reset