A Tractable Algorithm For Finite-Horizon Continuous Reinforcement Learning

06/26/2019
by   Phanideep Gampa, et al.
0

We consider the finite horizon continuous reinforcement learning problem. Our contribution is three-fold. First,we give a tractable algorithm based on optimistic value iteration for the problem. Next,we give a lower bound on regret of order Ω(T^2/3) for any algorithm discretizes the state space, improving the previous regret bound of Ω(T^1/2) of Ortner and Ryabko contrl for the same problem. Next,under the assumption that the rewards and transitions are Hölder Continuous we show that the upper bound on the discretization error is const.Ln^-αT. Finally,we give some simple experiments to validate our propositions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/03/2022

Square-root regret bounds for continuous-time episodic Markov decision processes

We study reinforcement learning for continuous-time Markov decision proc...
research
10/05/2022

Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration and Planning

We study the problem of episodic reinforcement learning in continuous st...
research
06/27/2018

Dynamic Assortment Selection under the Nested Logit Models

We study a stylized dynamic assortment planning problem during a selling...
research
12/02/2021

Convergence Guarantees for Deep Epsilon Greedy Policy Learning

Policy learning is a quickly growing area. As robotics and computers con...
research
12/06/2022

State Space Closure: Revisiting Endless Online Level Generation via Reinforcement Learning

In this paper we revisit endless online level generation with the recent...
research
06/01/2020

Model-Based Reinforcement Learning with Value-Targeted Regression

This paper studies model-based reinforcement learning (RL) for regret mi...
research
02/11/2022

Rate-matching the regret lower-bound in the linear quadratic regulator with unknown dynamics

The theory of reinforcement learning currently suffers from a mismatch b...

Please sign up or login with your details

Forgot password? Click here to reset