A Direct Approximation of AIXI Using Logical State Abstractions

10/13/2022
by   Samuel Yang-Zhao, et al.
0

We propose a practical integration of logical state abstraction with AIXI, a Bayesian optimality notion for reinforcement learning agents, to significantly expand the model class that AIXI agents can be approximated over to complex history-dependent and structured environments. The state representation and reasoning framework is based on higher-order logic, which can be used to define and enumerate complex features on non-Markovian and structured environments. We address the problem of selecting the right subset of features to form state abstractions by adapting the Φ-MDP optimisation criterion from state abstraction theory. Exact Bayesian model learning is then achieved using a suitable generalisation of Context Tree Weighting over abstract state sequences. The resultant architecture can be integrated with different planning algorithms. Experimental results on controlling epidemics on large-scale contact networks validates the agent's performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2021

MDP Abstraction with Successor Features

Abstraction plays an important role for generalisation of knowledge and ...
research
12/26/2021

Reducing Planning Complexity of General Reinforcement Learning with Non-Markovian Abstractions

The field of General Reinforcement Learning (GRL) formulates the problem...
research
01/15/2017

Near Optimal Behavior via Approximate State Abstraction

The combinatorial explosion that plagues planning and reinforcement lear...
research
10/15/2021

Dynamic probabilistic logic models for effective abstractions in RL

State abstraction enables sample-efficient learning and better task tran...
research
06/09/2009

Feature Reinforcement Learning: Part I: Unstructured MDPs

General-purpose, intelligent, learning agents cycle through sequences of...
research
03/01/2022

A Theory of Abstraction in Reinforcement Learning

Reinforcement learning defines the problem facing agents that learn to m...
research
11/10/2022

Switching Attention in Time-Varying Environments via Bayesian Inference of Abstractions

Motivated by the goal of endowing robots with a means for focusing atten...

Please sign up or login with your details

Forgot password? Click here to reset