Successor Options: An Option Discovery Framework for Reinforcement Learning

05/14/2019
by   Rahul Ramesh, et al.
6

The options framework in reinforcement learning models the notion of a skill or a temporally extended sequence of actions. The discovery of a reusable set of skills has typically entailed building options, that navigate to bottleneck states. This work adopts a complementary approach, where we attempt to discover options that navigate to landmark states. These states are prototypical representatives of well-connected regions and can hence access the associated region with relative ease. In this work, we propose Successor Options, which leverages Successor Representations to build a model of the state space. The intra-option policies are learnt using a novel pseudo-reward and the model scales to high-dimensional spaces easily. Additionally, we also propose an Incremental Successor Options model that iterates between constructing Successor Representations and building options, which is useful when robust Successor Representations cannot be built solely from primitive actions. We demonstrate the efficacy of our approach on a collection of grid-worlds, and on the high-dimensional robotic control environment of Fetch.

READ FULL TEXT

page 1

page 2

page 4

page 5

page 6

research
09/09/2019

Option Encoder: A Framework for Discovering a Policy Basis in Reinforcement Learning

Option discovery and skill acquisition frameworks are integral to the fu...
research
05/17/2016

Option Discovery in Hierarchical Reinforcement Learning using Spatio-Temporal Clustering

This paper introduces an automated skill acquisition framework in reinfo...
research
09/05/2022

MO2: Model-Based Offline Options

The ability to discover useful behaviours from past experience and trans...
research
10/30/2017

Eigenoption Discovery through the Deep Successor Representation

Options in reinforcement learning allow agents to hierarchically decompo...
research
01/10/2013

Decision-Theoretic Planning with Concurrent Temporally Extended Actions

We investigate a model for planning under uncertainty with temporallyext...
research
08/06/2021

Temporally Abstract Partial Models

Humans and animals have the ability to reason and make predictions about...
research
02/09/2018

Learning Robust Options

Robust reinforcement learning aims to produce policies that have strong ...

Please sign up or login with your details

Forgot password? Click here to reset