A Concentration Bound for LSPE(λ)

11/04/2021
by   Vivek S. Borkar, et al.
4

The popular LSPE(λ) algorithm for policy evaluation is revisited to derive a concentration bound that gives high probability performance guarantees from some time on.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2022

A Concentration Bound for Distributed Stochastic Approximation

We revisit the classical model of Tsitsiklis, Bertsekas and Athans for d...
research
11/20/2019

Sparse random tensors: concentration, regularization and applications

We prove a non-asymptotic concentration inequality of sparse inhomogeneo...
research
07/03/2023

Fitting an ellipsoid to a quadratic number of random points

We consider the problem (P) of fitting n standard Gaussian random vector...
research
04/20/2020

Estimating Ising Models from One Sample

Given one sample X ∈{± 1}^n from an Ising model [X=x]∝(x^ J x/2), whose ...
research
01/03/2020

On the definition of a concentration function relevant to the ROC curve

This is a reader's reaction to a recent paper by E. Schechtman and G. Sc...
research
05/27/2020

An Ambient-Physical System to Infer Concentration in Open-plan Workplace

One of the core challenges in open-plan workspaces is to ensure a good l...
research
06/12/2020

Concentration Bounds for the Collision Estimator

We prove a strong concentration result about the natural collision estim...

Please sign up or login with your details

Forgot password? Click here to reset