Finding Options that Minimize Planning Time

10/16/2018
by   Yuu Jinnai, et al.
4

While adding temporally abstract actions, or options, to an agent's action repertoire can often accelerate learning and planning, existing approaches for determining which specific options to add are largely heuristic. We aim to formalize the problem of selecting the optimal set of options for planning, in two contexts: 1) finding the set of k options that minimize the number of value-iteration passes until convergence, and 2) computing the smallest set of options so that planning converges in less than a given maximum of ℓ value-iteration passes. We first show that both problems are NP-hard. We then provide a polynomial-time approximation algorithm for computing the optimal options for tasks with bounded return and goal states. We prove that the algorithm has bounded suboptimality for deterministic tasks. Finally, we empirically evaluate its performance against both the optimal options and a representative collection of heuristic approaches in simple grid-based domains including the classic four rooms problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/18/2016

Principled Option Learning in Markov Decision Processes

It is well known that options can make planning more efficient, among th...
research
02/17/2020

Pandora's Box Problem with Order Constraints

The Pandora's Box Problem, originally formalized by Weitzman in 1979, mo...
research
01/10/2013

Decision-Theoretic Planning with Concurrent Temporally Extended Actions

We investigate a model for planning under uncertainty with temporallyext...
research
05/25/2022

Toward Discovering Options that Achieve Faster Planning

We propose a new objective for option discovery that emphasizes the comp...
research
02/09/2018

Learning Robust Options

Robust reinforcement learning aims to produce policies that have strong ...
research
12/03/2016

A Matrix Splitting Perspective on Planning with Options

We show that the Bellman operator underlying the options framework leads...
research
02/24/2021

The Logical Options Framework

Learning composable policies for environments with complex rules and tas...

Please sign up or login with your details

Forgot password? Click here to reset