Learning Neural-Symbolic Descriptive Planning Models via Cube-Space Priors: The Voyage Home (to STRIPS)

04/27/2020
by   Masataro Asai, et al.
0

We achieved a new milestone in the difficult task of enabling agents to learn about their environment autonomously. Our neuro-symbolic architecture is trained end-to-end to produce a succinct and effective discrete state transition model from images alone. Our target representation (the Planning Domain Definition Language) is already in a form that off-the-shelf solvers can consume, and opens the door to the rich array of modern heuristic search capabilities. We demonstrate how the sophisticated innate prior we place on the learning process significantly reduces the complexity of the learned representation, and reveals a connection to the graph-theoretic notion of "cube-like graphs", thus opening the door to a deeper understanding of the ideal properties for learned symbolic representations. We show that the powerful domain-independent heuristics allow our system to solve visual 15-Puzzle instances which are beyond the reach of blind search, without resorting to the Reinforcement Learning approach that requires a huge amount of training on the domain-dependent reward information.

READ FULL TEXT
research
05/28/2021

Learning Neuro-Symbolic Relational Transition Models for Bilevel Planning

Despite recent, independent progress in model-based reinforcement learni...
research
09/12/2019

Learning First-Order Symbolic Planning Representations from Plain Graphs

One of the main obstacles for developing flexible AI system is the split...
research
04/29/2017

Classical Planning in Deep Latent Space: Bridging the Subsymbolic-Symbolic Boundary

Current domain-independent, classical planners require symbolic models o...
research
11/25/2022

Learning Visual Planning Models from Partially Observed Images

There has been increasing attention on planning model learning in classi...
research
04/01/2022

Symbolic Search for Optimal Planning with Expressive Extensions

In classical planning, the goal is to derive a course of actions that al...
research
09/20/2022

Graph Value Iteration

In recent years, deep Reinforcement Learning (RL) has been successful in...
research
07/10/2020

Learning Generalized Relational Heuristic Networks for Model-Agnostic Planning

Computing goal-directed behavior (sequential decision-making, or plannin...

Please sign up or login with your details

Forgot password? Click here to reset