Categorical semantics of compositional reinforcement learning

08/29/2022
by   Georgios Bakirtzis, et al.
0

Reinforcement learning (RL) often requires decomposing a problem into subtasks and composing learned behaviors on these tasks. Compositionality in RL has the potential to create modular subtask units that interface with other system capabilities. However, generating compositional models requires the characterization of minimal assumptions for the robustness of the compositional feature. We develop a framework for a compositional theory of RL using a categorical point of view. Given the categorical representation of compositionality, we investigate sufficient conditions under which learning-by-parts results in the same optimal policy as learning on the whole. In particular, our approach introduces a category 𝖬𝖣𝖯, whose objects are Markov decision processes (MDPs) acting as models of tasks. We show that 𝖬𝖣𝖯 admits natural compositional operations, such as certain fiber products and pushouts. These operations make explicit compositional phenomena in RL and unify existing constructions, such as puncturing hazardous states in composite MDPs and incorporating state-action symmetry. We also model sequential task completion by introducing the language of zig-zag diagrams that is an immediate application of the pushout operation in 𝖬𝖣𝖯.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/17/2023

Compositional Probabilistic Model Checking with String Diagrams of MDPs

We present a compositional model checking algorithm for Markov decision ...
research
07/01/2022

Modular Lifelong Reinforcement Learning via Neural Composition

Humans commonly solve complex problems by decomposing them into easier s...
research
06/10/2015

The Online Coupon-Collector Problem and Its Application to Lifelong Reinforcement Learning

Transferring knowledge across a sequence of related tasks is an importan...
research
08/31/2020

Efficient Reinforcement Learning in Factored MDPs with Application to Constrained RL

Reinforcement learning (RL) in episodic, factored Markov decision proces...
research
07/16/2023

Compositional Solution of Mean Payoff Games by String Diagrams

Following our recent development of a compositional model checking algor...
research
11/17/2018

Autonomous Extraction of a Hierarchical Structure of Tasks in Reinforcement Learning, A Sequential Associate Rule Mining Approach

Reinforcement learning (RL) techniques, while often powerful, can suffer...
research
07/25/2022

Lifelong Machine Learning of Functionally Compositional Structures

A hallmark of human intelligence is the ability to construct self-contai...

Please sign up or login with your details

Forgot password? Click here to reset