Episodic Reinforcement Learning in Finite MDPs: Minimax Lower Bounds Revisited

10/07/2020
by   Omar Darwiche Domingues, et al.
0

In this paper, we propose new problem-independent lower bounds on the sample complexity and regret in episodic MDPs, with a particular focus on the non-stationary case in which the transition kernel is allowed to change in each stage of the episode. Our main contribution is a novel lower bound of Ω((H^3SA/ϵ^2)log(1/δ)) on the sample complexity of an (ε,δ)-PAC algorithm for best policy identification in a non-stationary MDP. This lower bound relies on a construction of "hard MDPs" which is different from the ones previously used in the literature. Using this same class of MDPs, we also provide a rigorous proof of the Ω(√(H^3SAT)) regret bound for non-stationary MDPs. Finally, we discuss connections to PAC-MDP lower bounds.

READ FULL TEXT
research
01/11/2023

Adversarial Online Multi-Task Reinforcement Learning

We consider the adversarial online multi-task reinforcement learning set...
research
03/25/2019

Sample Complexity Lower Bounds for Linear System Identification

This paper establishes problem-specific sample complexity lower bounds f...
research
11/14/2022

Offline Estimation of Controlled Markov Chains: Minimax Nonparametric Estimators and Sample Efficiency

Controlled Markov chains (CMCs) form the bedrock for model-based reinfor...
research
06/13/2021

Towards Tight Bounds on the Sample Complexity of Average-reward MDPs

We prove new upper and lower bounds for sample complexity of finding an ...
research
06/03/2018

Exploration in Structured Reinforcement Learning

We address reinforcement learning problems with finite state and action ...
research
10/11/2022

Multi-User Reinforcement Learning with Low Rank Rewards

In this work, we consider the problem of collaborative multi-user reinfo...
research
12/15/2019

On the I/O complexity of hybrid algorithms for Integer Multiplication

Almost asymptotically tight lower bounds are derived for the I/O complex...

Please sign up or login with your details

Forgot password? Click here to reset