Learning user-defined sub-goals using memory editing in reinforcement learning

05/01/2022
by   GyeongTaek Lee, et al.
0

The aim of reinforcement learning (RL) is to allow the agent to achieve the final goal. Most RL studies have focused on improving the efficiency of learning to achieve the final goal faster. However, the RL model is very difficult to modify an intermediate route in the process of reaching the final goal. That is, the agent cannot be under control to achieve other sub-goals in the existing studies. If the agent can go through the sub-goals on the way to the destination, the RL can be applied and studied in various fields. In this study, I propose a methodology to achieve the user-defined sub-goals as well as the final goal using memory editing. The memory editing is performed to generate various sub-goals and give an additional reward to the agent. In addition, the sub-goals are separately learned from the final goal. I set two simple environments and various scenarios in the test environments. As a result, the agent almost successfully passed the sub-goals as well as the final goal under control. Moreover, the agent was able to be induced to visit the novel state indirectly in the environments. I expect that this methodology can be used in the fields that need to control the agent in a variety of scenarios.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/20/2022

A Fully Controllable Agent in the Path Planning using Goal-Conditioned Reinforcement Learning

The aim of path planning is to reach the goal from starting point by sea...
research
06/22/2018

Many-Goals Reinforcement Learning

All-goals updating exploits the off-policy nature of Q-learning to updat...
research
02/15/2022

Interpretable Reinforcement Learning with Multilevel Subgoal Discovery

We propose a novel Reinforcement Learning model for discrete environment...
research
06/20/2023

Int-HRL: Towards Intention-based Hierarchical Reinforcement Learning

While deep reinforcement learning (RL) agents outperform humans on an in...
research
01/31/2023

Learning, Fast and Slow: A Goal-Directed Memory-Based Approach for Dynamic Environments

Model-based next state prediction and state value prediction are slow to...
research
07/17/2018

Reinforcement Learning for LTLf/LDLf Goals

MDPs extended with LTLf/LDLf non-Markovian rewards have recently attract...
research
10/06/2018

Q-map: a Convolutional Approach for Goal-Oriented Reinforcement Learning

Goal-oriented learning has become a core concept in reinforcement learni...

Please sign up or login with your details

Forgot password? Click here to reset