Two Timescale Stochastic Approximation with Controlled Markov noise and Off-policy temporal difference learning

03/31/2015
by   Prasenjit Karmakar, et al.
0

We present for the first time an asymptotic convergence analysis of two time-scale stochastic approximation driven by `controlled' Markov noise. In particular, both the faster and slower recursions have non-additive controlled Markov noise components in addition to martingale difference noise. We analyze the asymptotic behavior of our framework by relating it to limiting differential inclusions in both time-scales that are defined in terms of the ergodic occupation measures associated with the controlled Markov processes. Finally, we present a solution to the off-policy convergence problem for temporal difference learning with linear function approximation, using our results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/23/2015

Stability of Stochastic Approximations with `Controlled Markov' Noise and Temporal Difference Learning

In this paper we present a `stability theorem' for stochastic approximat...
research
10/24/2020

An Adiabatic Theorem for Policy Tracking with TD-learning

We evaluate the ability of temporal difference learning to track the rew...
research
09/01/2023

Controlled Martingale Problems And Their Markov Mimics

In this article we prove under suitable assumptions that the marginals o...
research
05/11/2017

A First Empirical Study of Emphatic Temporal Difference Learning

In this paper we present the first empirical study of the emphatic tempo...
research
07/01/2022

A Stochastic Contraction Mapping Theorem

In this paper we define contractive and nonexpansive properties for adap...
research
11/23/2021

Schedule Based Temporal Difference Algorithms

Learning the value function of a given policy from data samples is an im...
research
10/03/2022

Bias and Extrapolation in Markovian Linear Stochastic Approximation with Constant Stepsizes

We consider Linear Stochastic Approximation (LSA) with a constant stepsi...

Please sign up or login with your details

Forgot password? Click here to reset