Learning Generalized Reactive Policies using Deep Neural Networks

08/24/2017
by   Edward Groshev, et al.
0

We consider the problem of learning for planning, where knowledge acquired while planning is reused to plan faster in new problem instances. For robotic tasks, among others, plan execution can be captured as a sequence of visual images. For such domains, we propose to use deep neural networks in learning for planning, based on learning a reactive policy that imitates execution traces produced by a planner. We investigate architectural properties of deep networks that are suitable for learning long-horizon planning behavior, and explore how to learn, in addition to the policy, a heuristic function that can be used with classical planners or search algorithms such as A*. Our results on the challenging Sokoban domain show that, with a suitable network design, complex decision making policies and powerful heuristic functions can be learned through imitation.

READ FULL TEXT
research
02/27/2013

Integrating Planning and Execution in Stochastic Domains

We investigate planning in time-critical domains represented as Markov D...
research
12/03/2021

Heuristic Search Planning with Deep Neural Networks using Imitation, Attention and Curriculum Learning

Learning a well-informed heuristic function for hard task planning domai...
research
08/04/2019

ASNets: Deep Learning for Generalised Planning

In this paper, we discuss the learning of generalised policies for proba...
research
10/26/2018

Transfer of Deep Reactive Policies for MDP Planning

Domain-independent probabilistic planners input an MDP description in a ...
research
04/21/2022

PG3: Policy-Guided Planning for Generalized Policy Generation

A longstanding objective in classical planning is to synthesize policies...
research
04/05/2019

Scalable Nonlinear Planning with Deep Neural Network Learned Transition Models

In many real-world planning problems with factored, mixed discrete and c...
research
03/31/2016

Reactive Policies with Planning for Action Languages

We describe a representation in a high-level transition system for polic...

Please sign up or login with your details

Forgot password? Click here to reset