Convergence of Deep Convolutional Neural Networks

09/28/2021
by   Yuesheng Xu, et al.
0

Convergence of deep neural networks as the depth of the networks tends to infinity is fundamental in building the mathematical foundation for deep learning. In a previous study, we investigated this question for deep ReLU networks with a fixed width. This does not cover the important convolutional neural networks where the widths are increasing from layer to layer. For this reason, we first study convergence of general ReLU networks with increasing widths and then apply the results obtained to deep convolutional neural networks. It turns out the convergence reduces to convergence of infinite products of matrices with increasing sizes, which has not been considered in the literature. We establish sufficient conditions for convergence of such infinite products of matrices. Based on the conditions, we present sufficient conditions for piecewise convergence of general deep ReLU networks with increasing widths, and as well as pointwise convergence of deep ReLU convolutional neural networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/13/2022

Convergence of Deep Neural Networks with General Activation Functions and Pooling

Deep neural networks, as a powerful system to represent high dimensional...
research
07/27/2021

Convergence of Deep ReLU Networks

We explore convergence of deep neural networks with the popular ReLU act...
research
11/20/2020

A global universality of two-layer neural networks with ReLU activations

In the present study, we investigate a universality of neural networks, ...
research
05/13/2022

Convergence Analysis of Deep Residual Networks

Various powerful deep neural network architectures have made great contr...
research
06/15/2020

Globally Injective ReLU Networks

We study injective ReLU neural networks. Injectivity plays an important ...
research
02/07/2022

Neural Tangent Kernel Analysis of Deep Narrow Neural Networks

The tremendous recent progress in analyzing the training dynamics of ove...
research
12/14/2018

Products of Many Large Random Matrices and Gradients in Deep Neural Networks

We study products of random matrices in the regime where the number of t...

Please sign up or login with your details

Forgot password? Click here to reset