Tight Bounds on the Smallest Eigenvalue of the Neural Tangent Kernel for Deep ReLU Networks

12/21/2020
by   Quynh Nguyen, et al.
0

A recent line of work has analyzed the theoretical properties of deep neural networks via the Neural Tangent Kernel (NTK). In particular, the smallest eigenvalue of the NTK has been related to memorization capacity, convergence of gradient descent algorithms and generalization of deep nets. However, existing results either provide bounds in the two-layer setting or assume that the spectrum of the NTK is bounded away from 0 for multi-layer networks. In this paper, we provide tight bounds on the smallest eigenvalue of NTK matrices for deep ReLU networks, both in the limiting case of infinite widths and for finite widths. In the finite-width setting, the network architectures we consider are quite general: we require the existence of a wide layer with roughly order of N neurons, N being the number of data samples; and the scaling of the remaining widths is arbitrary (up to logarithmic factors). To obtain our results, we analyze various quantities of independent interest: we give lower bounds on the smallest singular value of feature matrices, and upper bounds on the Lipschitz constant of input-output feature maps.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/09/2019

Some observations on the smallest adjacency eigenvalue of a graph

In this paper, we discuss various connections between the smallest eigen...
research
01/01/2023

Sharper analysis of sparsely activated wide neural networks with trainable biases

This work studies training one-hidden-layer overparameterized ReLU netwo...
research
05/20/2022

Memorization and Optimization in Deep Neural Networks with Minimum Over-parameterization

The Neural Tangent Kernel (NTK) has emerged as a powerful tool to provid...
research
02/01/2017

On the optimality and sharpness of Laguerre's lower bound on the smallest eigenvalue of a symmetric positive definite matrix

Lower bounds on the smallest eigenvalue of a symmetric positive definite...
research
07/20/2021

Single-exponential bounds for the smallest singular value of Vandermonde matrices in the sub-Rayleigh regime

Following recent interest by the community, the scaling of the minimal s...
research
06/26/2022

Bounding the Width of Neural Networks via Coupled Initialization – A Worst Case Analysis

A common method in training neural networks is to initialize all the wei...
research
11/02/2019

Improved bounds on the size of the smallest representation of relation algebra 32_65 with the aid of a SAT solver

In this paper, we shed new light on the spectrum of relation algebra 32_...

Please sign up or login with your details

Forgot password? Click here to reset