Goal Agnostic Planning using Maximum Likelihood Paths in Hypergraph World Models

10/18/2021
by   Christopher Robinson, et al.
0

In this paper, we present a hypergraph–based machine learning algorithm, a datastructure–driven maintenance method, and a planning algorithm based on a probabilistic application of Dijkstra's algorithm. Together, these form a goal agnostic automated planning engine for an autonomous learning agent which incorporates beneficial properties of both classical Machine Learning and traditional Artificial Intelligence. We prove that the algorithm determines optimal solutions within the problem space, mathematically bound learning performance, and supply a mathematical model analyzing system state progression through time yielding explicit predictions for learning curves, goal achievement rates, and response to abstractions and uncertainty. To validate performance, we exhibit results from applying the agent to three archetypal planning problems, including composite hierarchical domains, and highlight empirical findings which illustrate properties elucidated in the analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/29/2019

Learning Domain-Independent Planning Heuristics with Hypergraph Networks

We present the first approach capable of learning domain-independent pla...
research
01/29/2021

Counterfactual Planning in AGI Systems

We present counterfactual planning as a design approach for creating a r...
research
01/25/2023

Leveraging Planning Landmarks for Hybrid Online Goal Recognition

Goal recognition is an important problem in many application domains (e....
research
06/21/2019

Hybrid Planning for Dynamic Multimodal Stochastic Shortest Paths

Sequential decision problems in applications such as manipulation in war...
research
03/19/2019

Why Couldn't You do that? Explaining Unsolvability of Classical Planning Problems in the Presence of Plan Advice

Explainable planning is widely accepted as a prerequisite for autonomous...
research
04/03/2017

Multi-Advisor Reinforcement Learning

We consider tackling a single-agent RL problem by distributing it to n l...

Please sign up or login with your details

Forgot password? Click here to reset