Eigenvalue Distribution of Large Random Matrices Arising in Deep Neural Networks: Orthogonal Case

by   Leonid Pastur, et al.

The paper deals with the distribution of singular values of the input-output Jacobian of deep untrained neural networks in the limit of their infinite width. The Jacobian is the product of random matrices where the independent rectangular weight matrices alternate with diagonal matrices whose entries depend on the corresponding column of the nearest neighbor weight matrix. The problem was considered in <cit.> for the Gaussian weights and biases and also for the weights that are Haar distributed orthogonal matrices and Gaussian biases. Basing on a free probability argument, it was claimed that in these cases the singular value distribution of the Jacobian in the limit of infinite width (matrix size) coincides with that of the analog of the Jacobian with special random but weight independent diagonal matrices, the case well known in random matrix theory. The claim was rigorously proved in <cit.> for a quite general class of weights and biases with i.i.d. (including Gaussian) entries by using a version of the techniques of random matrix theory. In this paper we use another version of the techniques to justify the claim for random Haar distributed weight matrices and Gaussian biases.


page 1

page 2

page 3

page 4


On Random Matrices Arising in Deep Neural Networks: General I.I.D. Case

We study the distribution of singular values of product of random matric...

On the validity of kernel approximations for orthogonally-initialized neural networks

In this note we extend kernel function approximation results for neural ...

Random Toeplitz Matrices: The Condition Number under High Stochastic Dependence

In this paper, we study the condition number of a random Toeplitz matrix...

Random matrix analysis of deep neural network weight matrices

Neural networks have been used successfully in a variety of fields, whic...

Eigenvalue distribution of nonlinear models of random matrices

This paper is concerned with the asymptotic empirical eigenvalue distrib...

Correlated Weights in Infinite Limits of Deep Convolutional Neural Networks

Infinite width limits of deep neural networks often have tractable forms...

Please sign up or login with your details

Forgot password? Click here to reset