Finite-Time Error Analysis of Asynchronous Q-Learning with Discrete-Time Switching System Models

02/17/2021
by   Donghwan Lee, et al.
0

This paper develops a novel framework to analyze the convergence of Q-learning algorithm by using its connections to dynamical systems. We prove that asynchronous Q-learning with a constant step-size can be naturally formulated as discrete-time stochastic switched linear systems. Moreover, the evolution of the Q-learning estimation error is over- and underestimated by trajectories of two dynamical systems. Based on the schemes, a new finite-time analysis of the Q-learning is given with a finite-time error bound. It offers novel intuitive insights on analysis of Q-learning mainly based on control theoretic frameworks. By filling the gap between both domains in a synergistic way, this approach can potentially facilitate further progress in each field.

READ FULL TEXT
research
07/25/2022

Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View

Q-learning has long been one of the most popular reinforcement learning ...
research
11/15/2022

A Theory for Discrete-time Boolean Finite Dynamical Systems with Uncertainty

Dynamical Systems is a field that studies the collective behavior of obj...
research
02/27/2023

Polynomial-delay generation of functional digraphs up to isomorphism

We describe a procedure for the generation of functional digraphs up to ...
research
10/01/2020

A Finite Memory Interacting Pólya Contagion Network and its Approximating Dynamical Systems

We introduce a new model for contagion spread using a network of interac...
research
01/26/2012

Discrete and fuzzy dynamical genetic programming in the XCSF learning classifier system

A number of representation schemes have been presented for use within le...
research
10/20/2022

Factorisation in the semiring of finite dynamical systems

Finite dynamical systems (FDSs) are commonly used to model systems with ...
research
11/02/2017

Learning Linear Dynamical Systems via Spectral Filtering

We present an efficient and practical algorithm for the online predictio...

Please sign up or login with your details

Forgot password? Click here to reset