Stability of Stochastic Approximations with `Controlled Markov' Noise and Temporal Difference Learning

04/23/2015
by   Arunselvan Ramaswamy, et al.
0

In this paper we present a `stability theorem' for stochastic approximation (SA) algorithms with `controlled Markov' noise. Such algorithms were first studied by Borkar in 2006. Specifically, sufficient conditions are presented which guarantee the stability of the iterates. Further, under these conditions the iterates are shown to track a solution to the differential inclusion defined in terms of the ergodic occupation measures associated with the `controlled Markov' process. As an application to our main result we present an improvement to a general form of temporal difference learning algorithms. Specifically, we present sufficient conditions for their stability and convergence using our framework. This paper builds on the works of Borkar as well as Benveniste, Metivier and Priouret.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/31/2015

Two Timescale Stochastic Approximation with Controlled Markov noise and Off-policy temporal difference learning

We present for the first time an asymptotic convergence analysis of two ...
research
02/06/2015

A Generalization of the Borkar-Meyn Theorem for Stochastic Recursive Inclusions

In this paper the stability theorem of Borkar and Meyn is extended to in...
research
11/08/2021

On the Stochastic Stability of Deep Markov Models

Deep Markov models (DMM) are generative models that are scalable and exp...
research
09/01/2023

Controlled Martingale Problems And Their Markov Mimics

In this article we prove under suitable assumptions that the marginals o...
research
10/22/2019

Folding Polyominoes with Holes into a Cube

When can a polyomino piece of paper be folded into a unit cube? Prior wo...
research
08/28/2021

Stochastic Approximation with Discontinuous Dynamics, Differential Inclusions, and Applications

This work develops new results for stochastic approximation algorithms. ...

Please sign up or login with your details

Forgot password? Click here to reset