Skill Machines: Temporal Logic Composition in Reinforcement Learning

05/25/2022
by   Geraud Nangue Tasse, et al.
0

A major challenge in reinforcement learning is specifying tasks in a manner that is both interpretable and verifiable. One common approach is to specify tasks through reward machines – finite state machines that encode the task to be solved. We introduce skill machines, a representation that can be learned directly from these reward machines that encode the solution to such tasks. We propose a framework where an agent first learns a set of base skills in a reward-free setting, and then combines these skills with the learned skill machine to produce composite behaviours specified by any regular language, such as linear temporal logics. This provides the agent with the ability to map from complex logical task specifications to near-optimal behaviours zero-shot. We demonstrate our approach in both a tabular and high-dimensional video game environment, where an agent is faced with several of these complex, long-horizon tasks. Our results indicate that the agent is capable of satisfying extremely complex task specifications, producing near optimal performance with no further learning. Finally, we demonstrate that the performance of skill machines can be improved with regular offline reinforcement learning algorithms when optimal behaviours are desired.

READ FULL TEXT
research
12/20/2017

Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning

Learning policies for complex tasks that require multiple different skil...
research
06/23/2023

Offline Skill Graph (OSG): A Framework for Learning and Planning using Offline Reinforcement Learning Skills

Reinforcement Learning has received wide interest due to its success in ...
research
10/14/2022

Skill-Based Reinforcement Learning with Intrinsic Reward Matching

While unsupervised skill discovery has shown promise in autonomously acq...
research
11/30/2017

Learning to Compose Skills

We present a differentiable framework capable of learning a wide variety...
research
10/31/2017

Automata Guided Hierarchical Reinforcement Learning for Zero-shot Skill Composition

An obstacle that prevents the wide adoption of (deep) reinforcement lear...
research
01/06/2020

A Boolean Task Algebra for Reinforcement Learning

We propose a framework for defining a Boolean algebra over the space of ...
research
11/10/2020

Continual Learning of Control Primitives: Skill Discovery via Reset-Games

Reinforcement learning has the potential to automate the acquisition of ...

Please sign up or login with your details

Forgot password? Click here to reset