Bayesian Transfer Reinforcement Learning with Prior Knowledge Rules

09/30/2018
by   Michalis K. Titsias, et al.
0

We propose a probabilistic framework to directly insert prior knowledge in reinforcement learning (RL) algorithms by defining the behaviour policy as a Bayesian posterior distribution. Such a posterior combines task specific information with prior knowledge, thus allowing to achieve transfer learning across tasks. The resulting method is flexible and it can be easily incorporated to any standard off-policy and on-policy algorithms, such as those based on temporal differences and policy gradients. We develop a specific instance of this Bayesian transfer RL framework by expressing prior knowledge as general deterministic rules that can be useful in a large variety of tasks, such as navigation tasks. Also, we elaborate more on recent probabilistic and entropy-regularised RL by developing a novel temporal learning algorithm and show how to combine it with Bayesian transfer RL. Finally, we demonstrate our method for solving mazes and show that significant speed ups can be obtained.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/19/2020

Efficient Deep Reinforcement Learning through Policy Transfer

Transfer Learning (TL) has shown great potential to accelerate Reinforce...
research
02/18/2020

KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge

Reinforcement learning agents usually learn from scratch, which requires...
research
09/20/2022

Soft Action Priors: Towards Robust Policy Transfer

Despite success in many challenging problems, reinforcement learning (RL...
research
07/04/2012

Learning Bayesian Network Parameters with Prior Knowledge about Context-Specific Qualitative Influences

We present a method for learning the parameters of a Bayesian network wi...
research
06/15/2019

Injecting Prior Knowledge for Transfer Learning into Reinforcement Learning Algorithms using Logic Tensor Networks

Human ability at solving complex tasks is helped by priors on object and...
research
12/24/2018

Moment Matching Training for Neural Machine Translation: A Preliminary Study

In previous works, neural sequence models have been shown to improve sig...
research
07/10/2020

Pre-trained Word Embeddings for Goal-conditional Transfer Learning in Reinforcement Learning

Reinforcement learning (RL) algorithms typically start tabula rasa, with...

Please sign up or login with your details

Forgot password? Click here to reset