Reinforcement Learning in POMDPs with Memoryless Options and Option-Observation Initiation Sets

08/22/2017
by   Denis Steckelmacher, et al.
0

Many real-world reinforcement learning problems have a hierarchical nature, and often exhibit some degree of partial observability. While hierarchy and partial observability are usually tackled separately (for instance by combining recurrent neural networks and options), we show that addressing both problems simultaneously is simpler and more efficient in many cases. More specifically, we make the initiation set of options conditional on the previously-executed option, and show that options with such Option-Observation Initiation Sets (OOIs) are at least as expressive as Finite State Controllers (FSCs), a state-of-the-art approach for learning in POMDPs. OOIs are easy to design based on an intuitive description of the task, lead to explainable policies and keep the top-level and option policies memoryless. Our experiments show that OOIs allow agents to learn optimal policies in challenging POMDPs, while being much more sample-efficient than a recurrent neural network over options.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/21/2016

Options Discovery with Budgeted Reinforcement Learning

We consider the problem of learning hierarchical policies for Reinforcem...
research
01/06/2020

Optimal Options for Multi-Task Reinforcement Learning Under Time Constraints

Reinforcement learning can greatly benefit from the use of options as a ...
research
06/12/2022

Matching options to tasks using Option-Indexed Hierarchical Reinforcement Learning

The options framework in Hierarchical Reinforcement Learning breaks down...
research
04/18/2023

Option-Driven Design: Context, Tradeoffs, and Considerations for Accessibility

In Option-Driven Design, users must interact with options and settings f...
research
06/24/2019

DynoPlan: Combining Motion Planning and Deep Neural Network based Controllers for Safe HRL

Many realistic robotics tasks are best solved compositionally, through c...
research
04/27/2016

Classifying Options for Deep Reinforcement Learning

In this paper we combine one method for hierarchical reinforcement learn...
research
09/30/2022

Multi-Task Option Learning and Discovery for Stochastic Path Planning

This paper addresses the problem of reliably and efficiently solving bro...

Please sign up or login with your details

Forgot password? Click here to reset