Uncertainty Maximization in Partially Observable Domains: A Cognitive Perspective

02/22/2021
by   Mirza Ramicic, et al.
0

Faced with an ever-increasing complexity of their domains of application, artificial learning agents are now able to scale up in their ability to process an overwhelming amount of information coming from their interaction with an environment. However, this process of scaling does come with a cost of encoding and processing an increasing amount of redundant information that is not necessarily beneficial to the learning process itself. This work exploits the properties of the learning systems defined over partially observable domains by selectively focusing on the specific type of information that is more likely to express the causal interaction among the transitioning states of the environment. Adaptive masking of the observation space based on the temporal difference displacement criterion enabled a significant improvement in convergence of temporal difference algorithms defined over a partially observable Markov process.

READ FULL TEXT

page 13

page 14

page 20

research
09/09/2013

Technical Report: Distribution Temporal Logic: Combining Correctness with Quality of Estimation

We present a new temporal logic called Distribution Temporal Logic (DTL)...
research
09/13/2021

Learning to Act and Observe in Partially Observable Domains

We consider a learning agent in a partially observable environment, with...
research
10/18/2021

Lifting DecPOMDPs for Nanoscale Systems – A Work in Progress

DNA-based nanonetworks have a wide range of promising use cases, especia...
research
06/25/2019

Learning Causal State Representations of Partially Observable Environments

Intelligent agents can cope with sensory-rich environments by learning t...
research
06/27/2012

Apprenticeship Learning for Model Parameters of Partially Observable Environments

We consider apprenticeship learning, i.e., having an agent learn a task ...
research
11/23/2019

Combined Model for Partially-Observable and Non-Observable Task Switching:Solving Hierarchical Reinforcement Learning Problems

An integral function of fully autonomous robots and humans is the abilit...
research
12/01/2013

Efficient Learning and Planning with Compressed Predictive States

Predictive state representations (PSRs) offer an expressive framework fo...

Please sign up or login with your details

Forgot password? Click here to reset