Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning

08/08/2021
by   Pratik Ramprasad, et al.
5

The recent emergence of reinforcement learning has created a demand for robust statistical inference methods for the parameter estimates computed using these algorithms. Existing methods for statistical inference in online learning are restricted to settings involving independently sampled observations, while existing statistical inference methods in reinforcement learning (RL) are limited to the batch setting. The online bootstrap is a flexible and efficient approach for statistical inference in linear stochastic approximation algorithms, but its efficacy in settings involving Markov noise, such as RL, has yet to be explored. In this paper, we study the use of the online bootstrap method for statistical inference in RL. In particular, we focus on the temporal difference (TD) learning and Gradient TD (GTD) learning algorithms, which are themselves special instances of linear stochastic approximation under Markov noise. The method is shown to be distributionally consistent for statistical inference in policy evaluation, and numerical experiments are included to demonstrate the effectiveness of this algorithm at statistical inference tasks across a range of real RL environments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/06/2021

Bootstrapping Statistical Inference for Off-Policy Evaluation

Bootstrapping provides a flexible and effective approach for assessing t...
research
06/03/2020

Causality and Batch Reinforcement Learning: Complementary Approaches To Planning In Unknown Domains

Reinforcement learning algorithms have had tremendous successes in onlin...
research
04/18/2020

Statistical inference in massive datasets by empirical likelihood

In this paper, we propose a new statistical inference method for massive...
research
01/19/2023

Parametrization Cookbook: A set of Bijective Parametrizations for using Machine Learning methods in Statistical Inference

We present in this paper a way to transform a constrained statistical in...
research
10/12/2019

Efficient Inference and Exploration for Reinforcement Learning

Despite an ever growing literature on reinforcement learning algorithms ...
research
02/14/2022

Statistical Inference After Adaptive Sampling in Non-Markovian Environments

There is a great desire to use adaptive sampling methods, such as reinfo...
research
12/19/2020

Fiducial inference then and now

We conduct a review of the fiducial approach to statistical inference, f...

Please sign up or login with your details

Forgot password? Click here to reset