Successor Uncertainties: exploration and uncertainty in temporal difference learning

10/15/2018
by   David Janz, et al.
0

We consider the problem of balancing exploration and exploitation in sequential decision making problems. To explore efficiently, it is vital to consider the uncertainty over all consequences of a decision, and not just those that follow immediately; the uncertainties involved need to be propagated according to the dynamics of the problem. To this end, we develop Successor Uncertainties, a probabilistic model for the state-action value function of a Markov Decision Process that propagates uncertainties in a coherent and scalable way. We relate our approach to other classical and contemporary methods for exploration and present an empirical analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2013

Direct Uncertainty Estimation in Reinforcement Learning

Optimal probabilistic approach in reinforcement learning is computationa...
research
07/10/2020

Improved Analysis of UCRL2 with Empirical Bernstein Inequality

We consider the problem of exploration-exploitation in communicating Mar...
research
04/04/2021

Active Trajectory Estimation for Partially Observed Markov Decision Processes via Conditional Entropy

In this paper, we consider the problem of controlling a partially observ...
research
11/01/2019

Frequentist Regret Bounds for Randomized Least-Squares Value Iteration

We consider the exploration-exploitation dilemma in finite-horizon reinf...
research
08/19/2020

On velocity and migration structural uncertainties: A new approach using non-linear slope tomography

Evaluating structural uncertainties associated with seismic imaging and ...
research
08/21/2021

Sequential Stochastic Optimization in Separable Learning Environments

We consider a class of sequential decision-making problems under uncerta...
research
08/04/2022

AG2U – Autonomous Grading Under Uncertainties

Surface grading, the process of leveling an uneven area containing pre-d...

Please sign up or login with your details

Forgot password? Click here to reset