A Small Gain Analysis of Single Timescale Actor Critic

03/04/2022
by   Alex Olshevsky, et al.
0

We consider a version of actor-critic which uses proportional step-sizes and only one critic update with a single sample from the stationary distribution per actor step. We provide an analysis of this method using the small-gain theorem. Specifically, we prove that this method can be used to find a stationary point, and that the resulting sample complexity improves the state of the art for actor-critic methods to O (μ^-2ϵ^-2) to find an ϵ-approximate stationary point where μ is the condition number associated with the critic.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2022

Finite-time analysis of single-timescale actor-critic

Despite the great empirical success of actor-critic methods, its finite-...
research
07/23/2019

Variance Reduction in Actor Critic Methods (ACM)

After presenting Actor Critic Methods (ACM), we show ACM are control var...
research
05/24/2021

GMAC: A Distributional Perspective on Actor-Critic Framework

In this paper, we devise a distributional framework on actor-critic as a...
research
10/02/2020

A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms

We investigate the discounting mismatch in actor-critic algorithm implem...
research
06/12/2022

Finite-Time Analysis of Fully Decentralized Single-Timescale Actor-Critic

Decentralized Actor-Critic (AC) algorithms have been widely utilized for...
research
07/05/2022

Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework

Despite the promising results achieved, state-of-the-art interactive rei...
research
08/17/2017

Robust Contextual Bandit via the Capped-ℓ_2 norm

This paper considers the actor-critic contextual bandit for the mobile h...

Please sign up or login with your details

Forgot password? Click here to reset