Go Beyond Imagination: Maximizing Episodic Reachability with World Models

08/25/2023
by   Yao Fu, et al.
0

Efficient exploration is a challenging topic in reinforcement learning, especially for sparse reward tasks. To deal with the reward sparsity, people commonly apply intrinsic rewards to motivate agents to explore the state space efficiently. In this paper, we introduce a new intrinsic reward design called GoBI - Go Beyond Imagination, which combines the traditional lifelong novelty motivation with an episodic intrinsic reward that is designed to maximize the stepwise reachability expansion. More specifically, we apply learned world models to generate predicted future states with random actions. States with more unique predictions that are not in episodic memory are assigned high intrinsic rewards. Our method greatly outperforms previous state-of-the-art methods on 12 of the most challenging Minigrid navigation tasks and improves the sample efficiency on locomotion tasks from DeepMind Control Suite.

READ FULL TEXT

page 2

page 4

page 5

page 15

research
02/21/2023

Curiosity-driven Exploration in Sparse-reward Multi-agent Reinforcement Learning

Sparsity of rewards while applying a deep reinforcement learning method ...
research
05/20/2021

Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness

Sparse rewards are double-edged training signals in reinforcement learni...
research
11/28/2022

CIM: Constrained Intrinsic Motivation for Sparse-Reward Continuous Control

Intrinsic motivation is a promising exploration technique for solving re...
research
08/24/2022

Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration

The sparsity of extrinsic rewards poses a serious challenge for reinforc...
research
10/02/2018

EMI: Exploration with Mutual Information Maximizing State and Action Embeddings

Policy optimization struggles when the reward feedback signal is very sp...
research
12/21/2022

Reward Bonuses with Gain Scheduling Inspired by Iterative Deepening Search

This paper introduces a novel method of adding intrinsic bonuses to task...
research
11/16/2016

Reinforcement Learning with Unsupervised Auxiliary Tasks

Deep reinforcement learning agents have achieved state-of-the-art result...

Please sign up or login with your details

Forgot password? Click here to reset