Learning discrete state abstractions with deep variational inference

03/09/2020
by   Ondrej Biza, et al.
48

Abstraction is crucial for effective sequential decision making in domains with large state spaces. In this work, we propose a variational information bottleneck method for learning approximate bisimulations, a type of state abstraction. We use a deep neural net encoder to map states onto continuous embeddings. The continuous latent space is then compressed into a discrete representation using an action-conditioned hidden Markov model, which is trained end-to-end with the neural network. Our method is suited for environments with high-dimensional states and learns from a stream of experience collected by an agent acting in a Markov decision process. Through a learned discrete abstract model, we can efficiently plan for unseen goals in a multi-goal Reinforcement Learning setting. We test our method in simplified robotic manipulation domains with image states. We also compare it against previous model-based approaches to finding bisimulations in discrete grid-world-like environments.

READ FULL TEXT

page 4

page 6

page 7

page 8

page 9

page 11

page 12

page 14

research
06/07/2022

Discrete State-Action Abstraction via the Successor Representation

When reinforcement learning is applied with sparse rewards, agents must ...
research
05/23/2018

Variational Inference for Data-Efficient Model Learning in POMDPs

Partially observable Markov decision processes (POMDPs) are a powerful a...
research
09/11/2019

Correlation Priors for Reinforcement Learning

Many decision-making problems naturally exhibit pronounced structures in...
research
08/30/2022

An Analysis of Abstracted Model-Based Reinforcement Learning

Many methods for Model-based Reinforcement learning (MBRL) provide guara...
research
03/29/2023

Ideal Abstractions for Decision-Focused Learning

We present a methodology for formulating simplifying abstractions in mac...
research
07/21/2023

Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors

Large language models (LLMs) are being applied as actors for sequential ...
research
04/26/2022

Learning Value Functions from Undirected State-only Experience

This paper tackles the problem of learning value functions from undirect...

Please sign up or login with your details

Forgot password? Click here to reset