Information is Power: Intrinsic Control via Information Capture

12/07/2021
by   Nicholas Rhinehart, et al.
13

Humans and animals explore their environment and acquire useful skills even in the absence of clear goals, exhibiting intrinsic motivation. The study of intrinsic motivation in artificial agents is concerned with the following question: what is a good general-purpose objective for an agent? We study this question in dynamic partially-observed environments, and argue that a compact and general learning objective is to minimize the entropy of the agent's state visitation estimated using a latent state-space model. This objective induces an agent to both gather information about its environment, corresponding to reducing uncertainty, and to gain control over its environment, corresponding to reducing the unpredictability of future world states. We instantiate this approach as a deep reinforcement learning agent equipped with a deep variational Bayes filter. We find that our agent learns to discover, represent, and exercise control of dynamic objects in a variety of partially-observed environments sensed with visual observations without extrinsic reward.

READ FULL TEXT

page 4

page 7

page 8

page 9

page 14

page 15

page 17

research
06/22/2020

Learning with AMIGo: Adversarially Motivated Intrinsic Goals

A key challenge for reinforcement learning (RL) consists of learning in ...
research
07/12/2021

Explore and Control with Adversarial Surprise

Reinforcement learning (RL) provides a framework for learning goal-direc...
research
08/05/2020

Learning Power Control from a Fixed Batch of Data

We address how to exploit power control data, gathered from a monitored ...
research
06/19/2019

Control What You Can: Intrinsically Motivated Task-Planning Agent

We present a novel intrinsically motivated agent that learns how to cont...
research
04/10/2020

Learning to Visually Navigate in Photorealistic Environments Without any Supervision

Learning to navigate in a realistic setting where an agent must rely sol...
research
12/29/2022

Intrinsic Motivation in Dynamical Control Systems

Biological systems often choose actions without an explicit reward signa...
research
04/25/2018

Generative Temporal Models with Spatial Memory for Partially Observed Environments

In model-based reinforcement learning, generative and temporal models of...

Please sign up or login with your details

Forgot password? Click here to reset