Asymptotics of Reinforcement Learning with Neural Networks

11/13/2019
by   Justin Sirignano, et al.
0

We prove that a single-layer neural network trained with the Q-learning algorithm converges in distribution to a random ordinary differential equation as the size of the model and the number of training steps become large. Analysis of the limit differential equation shows that it has a unique stationary solution which is the solution of the Bellman equation, thus giving the optimal control for the problem. In addition, we study the convergence of the limit differential equation to the stationary solution. As a by-product of our analysis, we obtain the limiting behavior of single-layer neural networks when trained on i.i.d. data with stochastic gradient descent under the widely-used Xavier initialization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/10/2021

Ordinary Differential Equation Models and their Computation Methods

In this article, I introduce the differential equation model and review ...
research
10/03/2022

A large sample theory for infinitesimal gradient boosting

Infinitesimal gradient boosting is defined as the vanishing-learning-rat...
research
12/17/2022

Convergence Analysis for Training Stochastic Neural Networks via Stochastic Gradient Descent

In this paper, we carry out numerical analysis to prove convergence of a...
research
03/10/2021

Full Gradient DQN Reinforcement Learning: A Provably Convergent Scheme

We analyze the DQN reinforcement learning algorithm as a stochastic appr...
research
08/19/2020

SODEN: A Scalable Continuous-Time Survival Model through Ordinary Differential Equation Networks

In this paper, we propose a flexible model for survival analysis using n...
research
10/09/2018

Collective evolution of weights in wide neural networks

We derive a nonlinear integro-differential transport equation describing...
research
05/05/2022

GANs as Gradient Flows that Converge

This paper approaches the unsupervised learning problem by gradient desc...

Please sign up or login with your details

Forgot password? Click here to reset