Combined Model for Partially-Observable and Non-Observable Task Switching:Solving Hierarchical Reinforcement Learning Problems

11/23/2019
by   Nibraas Khan, et al.
0

An integral function of fully autonomous robots and humans is the ability to focus attention on a few relevant percepts to reach a certain goal while disregarding irrelevant percepts. Humans and animals rely on the interactions between the Pre-Frontal Cortex and the Basal Ganglia to achieve this focus, which is known as working memory. The working memory toolkit (WMtk) was developed based on a computational neuroscience model of this phenomenon with the use of temporal difference learning for autonomous systems. Recent adaptations of the toolkit either utilize abstract task representations to solve non-observable tasks or storage of past input features to solve partially-observable tasks, but not both. We propose a new model, which combines both approaches to solve complex tasks with both Partially-Observable (PO) and Non-Observable (NO) components called PONOWMtk. The model learns when to store relevant cues in working memory as well as when to switch from one task representation to another based on external feedback. The results of our experiments show that PONOWMtk performs effectively for tasks that exhibit PO properties or NO properties or both.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/24/2022

Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems

We study Reinforcement Learning for partially observable dynamical syste...
research
04/18/2020

Model Predictive Path Integral Control Framework for Partially Observable Navigation: A Quadrotor Case Study

Recently, Model Predictive Path Integral (MPPI) control algorithm has be...
research
06/15/2023

Semantic HELM: An Interpretable Memory for Reinforcement Learning

Reinforcement learning agents deployed in the real world often have to c...
research
09/28/2018

Learning to Remember, Forget and Ignore using Attention Control in Memory

Typical neural networks with external memory do not effectively separate...
research
08/02/2016

Context Discovery for Model Learning in Partially Observable Environments

The ability to learn a model is essential for the success of autonomous ...
research
02/22/2021

Uncertainty Maximization in Partially Observable Domains: A Cognitive Perspective

Faced with an ever-increasing complexity of their domains of application...

Please sign up or login with your details

Forgot password? Click here to reset