Characterizing the Spectrum of the NTK via a Power Series Expansion

11/15/2022
by   Michael Murray, et al.
0

Under mild conditions on the network initialization we derive a power series expansion for the Neural Tangent Kernel (NTK) of arbitrarily deep feedforward networks in the infinite width limit. We provide expressions for the coefficients of this power series which depend on both the Hermite coefficients of the activation function as well as the depth of the network. We observe faster decay of the Hermite coefficients leads to faster decay in the NTK coefficients. Using this series, first we relate the effective rank of the NTK to the effective rank of the input-data Gram. Second, for data drawn uniformly on the sphere we derive an explicit formula for the eigenvalues of the NTK, which shows faster decay in the NTK coefficients implies a faster decay in its spectrum. From this we recover existing results on eigenvalue asymptotics for ReLU networks and comment on how the activation function influences the RKHS. Finally, for generic data and activation functions with sufficiently fast Hermite coefficient decay, we derive an asymptotic upper bound on the spectrum of the NTK.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/28/2020

Trainable Activation Function in Image Classification

In the current research of neural networks, the activation function is m...
research
04/06/2023

Wide neural networks: From non-gaussian random fields at initialization to the NTK geometry of training

Recent developments in applications of artificial neural networks with o...
research
08/16/2019

Effect of Activation Functions on the Training of Overparametrized Neural Nets

It is well-known that overparametrized neural networks trained using gra...
research
10/07/2019

Neural network integral representations with the ReLU activation function

We derive a formula for neural network integral representations on the s...
research
10/19/2021

Generalised Wendland functions for the sphere

In this paper we compute the spherical Fourier expansions coefficients f...
research
05/02/2022

Decay estimate of bivariate Chebyshev coefficients for functions with limited smoothness

We obtain the decay bounds for Chebyshev series coefficients of function...
research
09/10/2021

Adjoint Differentiation for generic matrix functions

We derive a formula for the adjoint A of a square-matrix operation of th...

Please sign up or login with your details

Forgot password? Click here to reset