Bayesian Q-learning with Assumed Density Filtering

12/09/2017
by   Heejin Jeong, et al.
1

While off-policy temporal difference methods have been broadly used in reinforcement learning due to their efficiency and simple implementation, their Bayesian counterparts have been relatively understudied. This is mainly because the max operator in the Bellman optimality equation brings non-linearity and inconsistent distributions over value function. In this paper, we introduce a new Bayesian approach to off-policy TD methods using Assumed Density Filtering, called ADFQ, which updates beliefs on action-values (Q) through an online Bayesian inference method. Uncertainty measures in the beliefs not only are used in exploration but they provide a natural regularization in the belief updates. We also present a connection between ADFQ and Q-learning. Our empirical results show the proposed ADFQ algorithms outperform comparing algorithms in several task domains. Moreover, our algorithms improve general drawbacks in BRL such as computational complexity, usage of uncertainty, and nonlinearity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/27/2020

Assumed Density Filtering Q-learning

While off-policy temporal difference (TD) methods have widely been used ...
research
09/14/2016

Bayesian Reinforcement Learning: A Survey

Bayesian methods for machine learning have been widely investigated, yie...
research
04/06/2019

Randomised Bayesian Least-Squares Policy Iteration

We introduce Bayesian least-squares policy iteration (BLSPI), an off-pol...
research
08/01/2020

Bayesian-Assisted Inference from Visualized Data

A Bayesian view of data interpretation suggests that a visualization use...
research
11/29/2017

Efficient exploration with Double Uncertain Value Networks

This paper studies directed exploration for reinforcement learning agent...
research
10/28/2021

Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates

Temporal-Difference (TD) learning methods, such as Q-Learning, have prov...

Please sign up or login with your details

Forgot password? Click here to reset