An L^2 Analysis of Reinforcement Learning in High Dimensions with Kernel and Neural Network Approximation

04/15/2021
by   Jihao Long, et al.
13

Reinforcement learning (RL) algorithms based on high-dimensional function approximation have achieved tremendous empirical success in large-scale problems with an enormous number of states. However, most analysis of such algorithms gives rise to error bounds that involve either the number of states or the number of features. This paper considers the situation where the function approximation is made either using the kernel method or the two-layer neural network model, in the context of a fitted Q-iteration algorithm with explicit regularization. We establish an Õ(H^3|𝒜|^1/4n^-1/4) bound for the optimal policy with Hn samples, where H is the length of each episode and |𝒜| is the size of action space. Our analysis hinges on analyzing the L^2 error of the approximated Q-function using n data points. Even though this result still requires a finite-sized action space, the error bound is independent of the dimensionality of the state space.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/09/2020

On Function Approximation in Reinforcement Learning: Optimism in the Face of Large State Spaces

The classical theory of reinforcement learning (RL) has focused on tabul...
research
12/10/2019

A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation

Q-learning with neural network function approximation (neural Q-learning...
research
10/28/2021

Equivariant Q Learning in Spatial Action Spaces

Recently, a variety of new equivariant neural network model architecture...
research
11/05/2021

Perturbational Complexity by Distribution Mismatch: A Systematic Analysis of Reinforcement Learning in Reproducing Kernel Hilbert Space

Most existing theoretical analysis of reinforcement learning (RL) is lim...
research
03/23/2017

Unsupervised Basis Function Adaptation for Reinforcement Learning

When using reinforcement learning (RL) algorithms to evaluate a policy i...
research
02/20/2023

Reinforcement Learning with Function Approximation: From Linear to Nonlinear

Function approximation has been an indispensable component in modern rei...
research
07/21/2014

Practical Kernel-Based Reinforcement Learning

Kernel-based reinforcement learning (KBRL) stands out among reinforcemen...

Please sign up or login with your details

Forgot password? Click here to reset