Convergence of Stochastic Approximation via Martingale and Converse Lyapunov Methods

05/03/2022
by   M. Vidyasagar, et al.
0

This paper is dedicated to Prof. Eduardo Sontag on the occasion of his seventieth birthday. In this paper, we build upon the ideas first proposed in Gladyshev (1965) to develop a very general framework for proving the almost sure boundedness and the convergence of stochastic approximation algorithms. These ideas are based on martingale methods and are in some ways simpler than convergence proofs based on the ODE method, e.g., Borkar-Meyn (2000). First we study the original version of the SA algorithm introduced in Robbins-Monro (1951), where the objective is to determine a zero of a function, when only noisy measurements of the function are available. The proof makes use of the general framework developed here, together with a new theorem on converse Lyapunov stability, which might be of independent interest. Next we study an alternate version of SA, first introduced in Kiefer-Wolfowitz (1952). The objective here is to find a stationary point of a scalar-valued function, using first-order differences to approximate its gradient. This problem is analyzed in Blum (1954), but with a very opaque proof. We reproduce Blum's conclusions using the proposed framework.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/12/2022

Formalization of a Stochastic Approximation Theorem

Stochastic approximation algorithms are iterative procedures which are u...
research
02/02/2023

Convergent First-Order Methods for Bi-level Optimization and Stackelberg Games

We propose an algorithm to solve a class of bi-level optimization proble...
research
02/23/2023

Unified Convergence Theory of Stochastic and Variance-Reduced Cubic Newton Methods

We study the widely known Cubic-Newton method in the stochastic setting ...
research
12/07/2020

Stochastic optimization with momentum: convergence, fluctuations, and traps avoidance

In this paper, a general stochastic optimization procedure is studied, u...
research
01/30/2019

A New Proof of Nonsignalling Multiprover Parallel Repetition Theorem

We present an information theoretic proof of the nonsignalling multiprov...
research
06/26/2018

A General Framework for Anytime Approximation in Probabilistic Databases

Anytime approximation algorithms for computing query probabilities over ...
research
06/28/2018

Successive Convex Approximation Algorithms for Sparse Signal Estimation with Nonconvex Regularizations

In this paper, we propose a successive convex approximation framework fo...

Please sign up or login with your details

Forgot password? Click here to reset