Learning to Act and Observe in Partially Observable Domains

09/13/2021
by   Thomas Bolander, et al.
0

We consider a learning agent in a partially observable environment, with which the agent has never interacted before, and about which it learns both what it can observe and how its actions affect the environment. The agent can learn about this domain from experience gathered by taking actions in the domain and observing their results. We present learning algorithms capable of learning as much as possible (in a well-defined sense) both about what is directly observable and about what actions do in the domain, given the learner's observational constraints. We differentiate the level of domain knowledge attained by each algorithm, and characterize the type of observations required to reach it. The algorithms use dynamic epistemic logic (DEL) to represent the learned domain information symbolically. Our work continues that of Bolander and Gierasimczuk (2015), which developed DEL-based learning algorithms based to learn domain information in fully observable domains.

READ FULL TEXT

page 3

page 9

page 10

page 16

page 27

page 29

page 33

page 34

research
06/27/2012

Apprenticeship Learning for Model Parameters of Partially Observable Environments

We consider apprenticeship learning, i.e., having an agent learn a task ...
research
01/15/2014

Learning Partially Observable Deterministic Action Models

We present exact algorithms for identifying deterministic-actions effect...
research
02/22/2021

Uncertainty Maximization in Partially Observable Domains: A Cognitive Perspective

Faced with an ever-increasing complexity of their domains of application...
research
03/03/2021

Semantic constraints to represent common sense required in household actions for multi-modal Learning-from-observation robot

The paradigm of learning-from-observation (LfO) enables a robot to learn...
research
11/03/2022

Leveraging Fully Observable Policies for Learning under Partial Observability

Reinforcement learning in partially observable domains is challenging du...
research
08/04/2021

Policy Gradients Incorporating the Future

Reasoning about the future – understanding how decisions in the present ...
research
11/04/2018

Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning

When observing the actions of others, humans carry out inferences about ...

Please sign up or login with your details

Forgot password? Click here to reset