Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives

06/25/2019
by   Anirudh Goyal, et al.
4

Reinforcement learning agents that operate in diverse and complex environments can benefit from the structured decomposition of their behavior. Often, this is addressed in the context of hierarchical reinforcement learning, where the aim is to decompose a policy into lower-level primitives or options, and a higher-level meta-policy that triggers the appropriate behaviors for a given situation. However, the meta-policy must still produce appropriate decisions in all states. In this work, we propose a policy design that decomposes into primitives, similarly to hierarchical reinforcement learning, but without a high-level meta-policy. Instead, each primitive can decide for themselves whether they wish to act in the current state. We use an information-theoretic mechanism for enabling this decentralized decision: each primitive chooses how much information it needs about the current state to make a decision and the primitive that requests the most information about the current state acts in the world. The primitives are regularized to use as little information as possible, which leads to natural competition and specialization. We experimentally demonstrate that this policy architecture improves over both flat and hierarchical policies in terms of generalization.

READ FULL TEXT

page 16

page 20

research
04/07/2023

CRISP: Curriculum inducing Primitive Informed Subgoal Prediction for Hierarchical Reinforcement Learning

Hierarchical reinforcement learning is a promising approach that uses te...
research
10/07/2021

Augmenting Reinforcement Learning with Behavior Primitives for Diverse Manipulation Tasks

Realistic manipulation tasks require a robot to interact with an environ...
research
09/30/2019

Efficient meta reinforcement learning via meta goal generation

Meta reinforcement learning (meta-RL) is able to accelerate the acquisit...
research
02/06/2020

Temporal-adaptive Hierarchical Reinforcement Learning

Hierarchical reinforcement learning (HRL) helps address large-scale and ...
research
03/04/2019

Model Primitive Hierarchical Lifelong Reinforcement Learning

Learning interpretable and transferable subpolicies and performing task ...
research
03/04/2021

Toward Robust Long Range Policy Transfer

Humans can master a new task within a few trials by drawing upon skills ...
research
01/14/2023

Desbordante: from benchmarking suite to high-performance science-intensive data profiler (preprint)

Pioneering data profiling systems such as Metanome and OpenClean brought...

Please sign up or login with your details

Forgot password? Click here to reset