Reward Bonuses with Gain Scheduling Inspired by Iterative Deepening Search

12/21/2022
by   Taisuke Kobayashi, et al.
0

This paper introduces a novel method of adding intrinsic bonuses to task-oriented reward function in order to efficiently facilitate reinforcement learning search. While various bonuses have been designed to date, they are analogous to the depth-first and breadth-first search algorithms in graph theory. This paper, therefore, first designs two bonuses for each of them. Then, a heuristic gain scheduling is applied to the designed bonuses, inspired by the iterative deepening search, which is known to inherit the advantages of the two search algorithms. The proposed method is expected to allow agent to efficiently reach the best solution in deeper states by gradually exploring unknown states. In three locomotion tasks with dense rewards and three simple tasks with sparse rewards, it is shown that the two types of bonuses contribute to the performance improvement of the different tasks complementarily. In addition, by combining them with the proposed gain scheduling, all tasks can be accomplished with high performance.

READ FULL TEXT
research
05/01/2021

Markov Rewards Processes with Impulse Rewards and Absorbing States

We study the expected accumulated reward for a discrete-time Markov rewa...
research
10/07/2022

Generative Augmented Flow Networks

The Generative Flow Network is a probabilistic framework where an agent ...
research
08/25/2023

Go Beyond Imagination: Maximizing Episodic Reachability with World Models

Efficient exploration is a challenging topic in reinforcement learning, ...
research
11/17/2020

Curiosity Based Reinforcement Learning on Robot Manufacturing Cell

This paper introduces a novel combination of scheduling control on a fle...
research
12/21/2020

Difference Rewards Policy Gradients

Policy gradient methods have become one of the most popular classes of a...
research
10/22/2022

Knowledge Retrieval With Functional Object-Oriented Networks

In this experiment, three different search algorithms are implemented fo...
research
08/13/2018

Relaxed Schedulers Can Efficiently Parallelize Iterative Algorithms

There has been significant progress in understanding the parallelism inh...

Please sign up or login with your details

Forgot password? Click here to reset