Variational Deep Q Network

11/30/2017
by   Yunhao Tang, et al.
1

We propose a framework that directly tackles the probability distribution of the value function parameters in Deep Q Network (DQN), with powerful variational inference subroutines to approximate the posterior of the parameters. We will establish the equivalence between our proposed surrogate objective and variational inference loss. Our new algorithm achieves efficient exploration and performs well on large scale chain Markov Decision Process (MDP).

READ FULL TEXT

page 7

page 11

research
11/03/2020

Amortized Variational Deep Q Network

Efficient exploration is one of the most important issues in deep reinfo...
research
10/02/2020

MCMC-Interactive Variational Inference

Leveraging well-established MCMC strategies, we propose MCMC-interactive...
research
02/27/2019

Training Variational Autoencoders with Buffered Stochastic Variational Inference

The recognition network in deep latent variable models such as variation...
research
03/11/2021

Variational inference with a quantum computer

Inference is the task of drawing conclusions about unobserved variables ...
research
08/12/2016

Scaling Factorial Hidden Markov Models: Stochastic Variational Inference without Messages

Factorial Hidden Markov Models (FHMMs) are powerful models for sequentia...
research
04/21/2018

Variational Inference In Pachinko Allocation Machines

The Pachinko Allocation Machine (PAM) is a deep topic model that allows ...
research
08/04/2020

Exploring Variational Deep Q Networks

This study provides both analysis and a refined, research-ready implemen...

Please sign up or login with your details

Forgot password? Click here to reset