State Space Decomposition and Subgoal Creation for Transfer in Deep Reinforcement Learning

05/24/2017
by   Himanshu Sahni, et al.
0

Typical reinforcement learning (RL) agents learn to complete tasks specified by reward functions tailored to their domain. As such, the policies they learn do not generalize even to similar domains. To address this issue, we develop a framework through which a deep RL agent learns to generalize policies from smaller, simpler domains to more complex ones using a recurrent attention mechanism. The task is presented to the agent as an image and an instruction specifying the goal. This meta-controller guides the agent towards its goal by designing a sequence of smaller subtasks on the part of the state space within the attention, effectively decomposing it. As a baseline, we consider a setup without attention as well. Our experiments show that the meta-controller learns to create subgoals within the attention.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/22/2019

State2vec: Off-Policy Successor Features Approximators

A major challenge in reinforcement learning (RL) is the design of agents...
research
12/15/2021

Feature-Attending Recurrent Modules for Generalization in Reinforcement Learning

Deep reinforcement learning (Deep RL) has recently seen significant prog...
research
06/02/2023

Tackling Unbounded State Spaces in Continuing Task Reinforcement Learning

While deep reinforcement learning (RL) algorithms have been successfully...
research
12/24/2021

On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks

In this effort we consider a reinforcement learning (RL) technique for s...
research
02/08/2022

Local Explanations for Reinforcement Learning

Many works in explainable AI have focused on explaining black-box classi...
research
06/06/2019

Towards Interpretable Reinforcement Learning Using Attention Augmented Agents

Inspired by recent work in attention models for image captioning and que...
research
11/19/2019

Attention Privileged Reinforcement Learning For Domain Transfer

Applying reinforcement learning (RL) to physical systems presents notabl...

Please sign up or login with your details

Forgot password? Click here to reset