Using State Predictions for Value Regularization in Curiosity Driven Deep Reinforcement Learning

09/30/2018
by   Gino Brunner, et al.
0

Learning in sparse reward settings remains a challenge in Reinforcement Learning, which is often addressed by using intrinsic rewards. One promising strategy is inspired by human curiosity, requiring the agent to learn to predict the future. In this paper a curiosity-driven agent is extended to use these predictions directly for training. To achieve this, the agent predicts the value function of the next state at any point in time. Subsequently, the consistency of this prediction with the current value function is measured, which is then used as a regularization term in the loss function of the algorithm. Experiments were made on grid-world environments as well as on a 3D navigation task, both with sparse rewards. In the first case the extended agent is able to learn significantly faster than the baselines.

READ FULL TEXT
research
03/06/2017

Neural Episodic Control

Deep reinforcement learning methods attain super-human performance in a ...
research
12/19/2013

Avoiding Confusion between Predictors and Inhibitors in Value Function Approximation

In reinforcement learning, the goal is to seek rewards and avoid punishm...
research
06/08/2016

Deep Successor Reinforcement Learning

Learning robust value functions given raw observations and rewards is no...
research
05/18/2022

World Value Functions: Knowledge Representation for Multitask Reinforcement Learning

An open problem in artificial intelligence is how to learn and represent...
research
02/14/2021

Sparse Attention Guided Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning

Training deep reinforcement learning agents on environments with multipl...
research
02/13/2023

Improving robot navigation in crowded environments using intrinsic rewards

Autonomous navigation in crowded environments is an open problem with ma...
research
05/25/2020

Dynamic Value Estimation for Single-Task Multi-Scene Reinforcement Learning

Training deep reinforcement learning agents on environments with multipl...

Please sign up or login with your details

Forgot password? Click here to reset