On Lower Bounds for Regret in Reinforcement Learning

08/09/2016
by   Ian Osband, et al.
0

This is a brief technical note to clarify the state of lower bounds on regret for reinforcement learning. In particular, this paper: - Reproduces a lower bound on regret for reinforcement learning, similar to the result of Theorem 5 in the journal UCRL2 paper (Jaksch et al 2010). - Clarifies that the proposed proof of Theorem 6 in the REGAL paper (Bartlett and Tewari 2009) does not hold using the standard techniques without further work. We suggest that this result should instead be considered a conjecture as it has no rigorous proof. - Suggests that the conjectured lower bound given by (Bartlett and Tewari 2009) is incorrect and, in fact, it is possible to improve the scaling of the upper bound to match the weaker lower bounds presented in this paper. We hope that this note serves to clarify existing results in the field of reinforcement learning and provides interesting motivation for future work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/09/2016

Posterior Sampling for Reinforcement Learning Without Episodes

This is a brief technical note to clarify some of the issues with applyi...
research
02/11/2022

Rate-matching the regret lower-bound in the linear quadratic regulator with unknown dynamics

The theory of reinforcement learning currently suffers from a mismatch b...
research
11/18/2019

Comments on the Du-Kakade-Wang-Yang Lower Bounds

Du, Kakade, Wang, and Yang recently established intriguing lower bounds ...
research
03/23/2021

Comments on "A Framework for Control System Design Subject to Average Data-Rate Constraints"

Theorem  4.1 in the 2011 paper "A Framework for Control System Design Su...
research
10/08/2019

Sharp uniform lower bounds for the Schur product theorem

By a result of Schur [J. Reine Angew. Math. 1911], the entrywise product...
research
05/18/2023

Improved and Partially-Tight Lower Bounds for Message-Passing Implementations of Multiplicity Queues

A multiplicity queue is a concurrently-defined data type which relaxes t...
research
07/29/2013

Tight Lower Bounds for Homology Inference

The homology groups of a manifold are important topological invariants t...

Please sign up or login with your details

Forgot password? Click here to reset