Action Grammars: A Cognitive Model for Learning Temporal Abstractions

07/29/2019
by   Robert Tjarko Lange, et al.
3

Hierarchical Reinforcement Learning algorithms have successfully been applied to temporal credit assignment problems with sparse reward signals. However, state-of-the-art algorithms require manual specification of subtask structures, a sample inefficient exploration phase and lack semantic interpretability. Human infants, on the other hand, efficiently detect hierarchical sub-structures induced by their surroundings. In this work we propose a cognitive-inspired Reinforcement Learning architecture which uses grammar induction to identify sub-goal policies. More specifically, by treating an on-policy trajectory as a sentence sampled from the policy-conditioned language of the environment, we identify hierarchical constituents with the help of unsupervised grammatical inference. The resulting set of temporal abstractions is called action grammars (Pastra & Aloimonos, 2012) and can be used to enable efficient imitation, transfer and online learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/23/2018

Hierarchical Approaches for Reinforcement Learning in Parameterized Action Space

We explore Deep Reinforcement Learning in a parameterized action space. ...
research
12/14/2020

Active Hierarchical Imitation and Reinforcement Learning

Humans can leverage hierarchical structures to split a task into sub-tas...
research
03/25/2022

Unsupervised Learning of Temporal Abstractions with Slot-based Transformers

The discovery of reusable sub-routines simplifies decision-making and pl...
research
07/18/2018

Representational efficiency outweighs action efficiency in human program induction

The importance of hierarchically structured representations for tractabl...
research
10/05/2021

Attaining Interpretability in Reinforcement Learning via Hierarchical Primitive Composition

Deep reinforcement learning has shown its effectiveness in various appli...
research
01/18/2020

Tree-Structured Policy based Progressive Reinforcement Learning for Temporally Language Grounding in Video

Temporally language grounding in untrimmed videos is a newly-raised task...
research
04/19/2023

Evolving Constrained Reinforcement Learning Policy

Evolutionary algorithms have been used to evolve a population of actors ...

Please sign up or login with your details

Forgot password? Click here to reset