Dealing with Sparse Rewards in Reinforcement Learning

10/21/2019
by   Joshua Hare, et al.
60

Successfully navigating a complex environment to obtain a desired outcome is a difficult task, that up to recently was believed to be capable only by humans. This perception has been broken down over time, especially with the introduction of deep reinforcement learning, which has greatly increased the difficulty of tasks that can be automated. However, for traditional reinforcement learning agents this requires an environment to be able to provide frequent extrinsic rewards, which are not known or accessible for many real-world environments. This project aims to explore and contrast existing reinforcement learning solutions that circumnavigate the difficulties of an environment that provide sparse rewards. Different reinforcement solutions will be implemented over a several video game environments with varying difficulty and varying frequency of rewards, as to properly investigate the applicability of these solutions. This project introduces a novel reinforcement learning solution, by combining aspects of two existing state of the art sparse reward solutions.

READ FULL TEXT

page 12

page 25

page 26

page 28

page 29

research
03/25/2022

Dealing with Sparse Rewards Using Graph Neural Networks

Deep reinforcement learning in partially observable environments is a di...
research
01/01/2020

Reinforcement Learning with Goal-Distance Gradient

Reinforcement learning usually uses the feedback rewards of environmenta...
research
11/23/2022

Actively Learning Costly Reward Functions for Reinforcement Learning

Transfer of recent advances in deep reinforcement learning to real-world...
research
02/10/2019

A Bandit Framework for Optimal Selection of Reinforcement Learning Agents

Deep Reinforcement Learning has been shown to be very successful in comp...
research
02/15/2020

The Archimedean trap: Why traditional reinforcement learning will probably not yield AGI

After generalizing the Archimedean property of real numbers in such a wa...
research
05/25/2017

Cross-Domain Perceptual Reward Functions

In reinforcement learning, we often define goals by specifying rewards w...
research
02/11/2019

Stochastic Reinforcement Learning

In reinforcement learning episodes, the rewards and punishments are ofte...

Please sign up or login with your details

Forgot password? Click here to reset