A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

06/03/2021
by   Mingde Zhao, et al.
33

We present an end-to-end, model-based deep reinforcement learning agent which dynamically attends to relevant parts of its state, in order to plan and to generalize better out-of-distribution. The agent's architecture uses a set representation and a bottleneck mechanism, forcing the number of entities to which the agent attends at each planning step to be small. In experiments with customized MiniGrid environments with different dynamics, we observe that the design allows agents to learn to plan effectively, by attending to the relevant objects, leading to better out-of-distribution generalization.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset