Options of Interest: Temporal Abstraction with Interest Functions

01/01/2020
by   Khimya Khetarpal, et al.
14

Temporal abstraction refers to the ability of an agent to use behaviours of controllers which act for a limited, variable amount of time. The options framework describes such behaviours as consisting of a subset of states in which they can initiate, an internal policy and a stochastic termination condition. However, much of the subsequent work on option discovery has ignored the initiation set, because of difficulty in learning it from data. We provide a generalization of initiation sets suitable for general function approximation, by defining an interest function associated with an option. We derive a gradient-based learning algorithm for interest functions, leading to a new interest-option-critic architecture. We investigate how interest functions can be leveraged to learn interpretable and reusable temporal abstractions. We demonstrate the efficacy of the proposed approach through quantitative and qualitative results, in both discrete and continuous environments.

READ FULL TEXT

page 4

page 5

page 6

page 7

research
09/16/2016

The Option-Critic Architecture

Temporal abstraction is key to scaling up learning and planning in reinf...
research
07/30/2020

Data-efficient Hindsight Off-policy Option Learning

Solutions to most complex tasks can be decomposed into simpler, intermed...
research
11/04/2020

Diversity-Enriched Option-Critic

Temporal abstraction allows reinforcement learning agents to represent k...
research
02/26/2019

The Termination Critic

In this work, we consider the problem of autonomously discovering behavi...
research
12/06/2022

Variable-Decision Frequency Option Critic

In classic reinforcement learning algorithms, agents make decisions at d...
research
12/04/2018

Natural Option Critic

The recently proposed option-critic architecture Bacon et al. provide a ...
research
11/22/2016

Variational Intrinsic Control

In this paper we introduce a new unsupervised reinforcement learning met...

Please sign up or login with your details

Forgot password? Click here to reset