Finite Horizon Q-learning: Stability, Convergence and Simulations

10/27/2021
by   Vivek VP, et al.
0

Q-learning is a popular reinforcement learning algorithm. This algorithm has however been studied and analysed mainly in the infinite horizon setting. There are several important applications which can be modeled in the framework of finite horizon Markov decision processes. We develop a version of Q-learning algorithm for finite horizon Markov decision processes (MDP) and provide a full proof of its stability and convergence. Our analysis of stability and convergence of finite horizon Q-learning is based entirely on the ordinary differential equations (O.D.E) method. We also demonstrate the performance of our algorithm on a setting of random MDP.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/11/2012

Metrics for Finite Markov Decision Processes

We present metrics for measuring the similarity of states in a finite Ma...
research
06/17/2019

Of Cores: A Partial-Exploration Framework for Markov Decision Processes

We introduce a framework for approximate analysis of Markov decision pro...
research
06/07/2023

Convergence of SARSA with linear function approximation: The random horizon case

The reinforcement learning algorithm SARSA combined with linear function...
research
05/27/2019

Temporal-difference learning for nonlinear value function approximation in the lazy training regime

We discuss the approximation of the value function for infinite-horizon ...
research
08/31/2022

Partial Counterfactual Identification for Infinite Horizon Partially Observable Markov Decision Process

This paper investigates the problem of bounding possible output from a c...
research
09/26/2013

Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes

This paper is devoted to fair optimization in Multiobjective Markov Deci...
research
02/23/2020

Periodic Q-Learning

The use of target networks is a common practice in deep reinforcement le...

Please sign up or login with your details

Forgot password? Click here to reset