Reinforcement Learning with Partially Known World Dynamics

12/12/2012
by   Christian R. Shelton, et al.
0

Reinforcement learning would enjoy better success on real-world problems if domain knowledge could be imparted to the algorithm by the modelers. Most problems have both hidden state and unknown dynamics. Partially observable Markov decision processes (POMDPs) allow for the modeling of both. Unfortunately, they do not provide a natural framework in which to specify knowledge about the domain dynamics. The designer must either admit to knowing nothing about the dynamics or completely specify the dynamics (thereby turning it into a planning problem). We propose a new framework called a partially known Markov decision process (PKMDP) which allows the designer to specify known dynamics while still leaving portions of the environment s dynamics unknown.The model represents NOT ONLY the environment dynamics but also the agents knowledge of the dynamics. We present a reinforcement learning algorithm for this model based on importance sampling. The algorithm incorporates planning based on the known dynamics and learning about the unknown dynamics. Our results clearly demonstrate the ability to add domain knowledge and the resulting benefits for learning.

READ FULL TEXT

page 1

page 2

page 3

page 6

page 7

research
07/29/2023

Dynamic deep-reinforcement-learning algorithm in Partially Observed Markov Decision Processes

Reinforcement learning has been greatly improved in recent studies and a...
research
05/28/2021

A New Algorithm for the LQR Problem with Partially Unknown Dynamics

We consider an LQR optimal control problem with partially unknown dynami...
research
07/09/2019

Partially Observable Planning and Learning for Systems with Non-Uniform Dynamics

We propose a neural network architecture, called TransNet, that combines...
research
11/18/2020

Domain Concretization from Examples: Addressing Missing Domain Knowledge via Robust Planning

The assumption of complete domain knowledge is not warranted for robot p...
research
02/09/2022

A Reinforcement Learning Approach to Domain-Knowledge Inclusion Using Grammar Guided Symbolic Regression

In recent years, symbolic regression has been of wide interest to provid...
research
05/16/2020

Lifelong Control of Off-grid Microgrid with Model Based Reinforcement Learning

The lifelong control problem of an off-grid microgrid is composed of two...
research
06/24/2022

RAPid-Learn: A Framework for Learning to Recover for Handling Novelties in Open-World Environments

We propose RAPid-Learn: Learning to Recover and Plan Again, a hybrid pla...

Please sign up or login with your details

Forgot password? Click here to reset