Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning

11/15/2019
by   Cameron Voloshin, et al.
51

Off-policy policy evaluation (OPE) is the problem of estimating the online performance of a policy using only pre-collected historical data generated by another policy. Given the increasing interest in deploying learning-based methods for safety-critical applications, many recent OPE methods have recently been proposed. Due to disparate experimental conditions from recent literature, the relative performance of current OPE methods is not well understood. In this work, we present the first comprehensive empirical analysis of a broad suite of OPE methods. Based on thousands of experiments and detailed empirical analyses, we offer a summarized set of guidelines for effectively using OPE in practice, and suggest directions for future research.

READ FULL TEXT

page 14

page 19

page 20

page 21

page 22

page 23

research
04/04/2016

Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning

In this paper we present a new way of predicting the performance of a re...
research
12/13/2019

More Efficient Off-Policy Evaluation through Regularized Targeted Learning

We study the problem of off-policy evaluation (OPE) in Reinforcement Lea...
research
06/06/2020

Stable and Efficient Policy Evaluation

Policy evaluation algorithms are essential to reinforcement learning due...
research
02/10/2018

Beyond the One Step Greedy Approach in Reinforcement Learning

The famous Policy Iteration algorithm alternates between policy improvem...
research
01/24/2022

Constrained Policy Optimization via Bayesian World Models

Improving sample-efficiency and safety are crucial challenges when deplo...
research
05/11/2017

A First Empirical Study of Emphatic Temporal Difference Learning

In this paper we present the first empirical study of the emphatic tempo...
research
12/23/2019

Learning an Interpretable Traffic Signal Control Policy

Signalized intersections are managed by controllers that assign right of...

Please sign up or login with your details

Forgot password? Click here to reset