Learning, Fast and Slow: A Goal-Directed Memory-Based Approach for Dynamic Environments

01/31/2023
by   John Chong Min Tan, et al.
0

Model-based next state prediction and state value prediction are slow to converge. To address these challenges, we do the following: i) Instead of a neural network, we do model-based planning using a parallel memory retrieval system (which we term the slow mechanism); ii) Instead of learning state values, we guide the agent's actions using goal-directed exploration, by using a neural network to choose the next action given the current state and the goal state (which we term the fast mechanism). The goal-directed exploration is trained online using hippocampal replay of visited states and future imagined states every single time step, leading to fast and efficient training. Empirical studies show that our proposed method has a 92 episodes in a dynamically changing grid world, significantly outperforming state-of-the-art actor critic mechanisms such as PPO (54 (24 that the future of Reinforcement Learning (RL) will be to model goals and sub-goals for various tasks, and plan it out in a goal-directed memory-based approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/25/2018

Floyd-Warshall Reinforcement Learning Learning from Past Experiences to Reach New Goals

Consider mutli-goal tasks that involve static environments and dynamic g...
research
02/21/2022

Goal-directed Planning and Goal Understanding by Active Inference: Evaluation Through Simulated and Physical Robot Experiments

We show that goal-directed action planning and generation in a teleologi...
research
05/01/2022

Learning user-defined sub-goals using memory editing in reinforcement learning

The aim of reinforcement learning (RL) is to allow the agent to achieve ...
research
04/11/2018

DORA The Explorer: Directed Outreaching Reinforcement Action-Selection

Exploration is a fundamental aspect of Reinforcement Learning, typically...
research
03/12/2019

Goal-Directed Behavior under Variational Predictive Coding: Dynamic Organization of Visual Attention and Working Memory

Mental simulation is a critical cognitive function for goal-directed beh...
research
01/04/2019

Accelerating Goal-Directed Reinforcement Learning by Model Characterization

We propose a hybrid approach aimed at improving the sample efficiency in...
research
02/09/2023

Scaling Goal-based Exploration via Pruning Proto-goals

One of the gnarliest challenges in reinforcement learning (RL) is explor...

Please sign up or login with your details

Forgot password? Click here to reset