Boundary between noise and information applied to filtering neural network weight matrices

06/08/2022
by   Max Staats, et al.
0

Deep neural networks have been successfully applied to a broad range of problems where overparametrization yields weight matrices which are partially random. A comparison of weight matrix singular vectors to the Porter-Thomas distribution suggests that there is a boundary between randomness and learned information in the singular value spectrum. Inspired by this finding, we introduce an algorithm for noise filtering, which both removes small singular values and reduces the magnitude of large singular values to counteract the effect of level repulsion between the noise and the information part of the spectrum. For networks trained in the presence of label noise, we indeed find that the generalization performance improves significantly due to noise filtering.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/28/2022

Random matrix analysis of deep neural network weight matrices

Neural networks have been used successfully in a variety of fields, whic...
research
02/12/2023

Koopman-Based Bound for Generalization: New Aspect of Neural Networks Regarding Nonlinear Noise Filtering

We propose a new bound for generalization of neural networks using Koopm...
research
06/30/2020

Data-driven Regularization via Racecar Training for Generalizing Neural Networks

We propose a novel training approach for improving the generalization in...
research
12/28/2018

On Computation and Generalization of GANs with Spectrum Control

Generative Adversarial Networks (GANs), though powerful, is hard to trai...
research
11/18/2016

Improving training of deep neural networks via Singular Value Bounding

Deep learning methods achieve great success recently on many computer vi...
research
05/15/2019

Orthogonal Deep Neural Networks

In this paper, we introduce the algorithms of Orthogonal Deep Neural Net...
research
08/14/2022

The SVD of Convolutional Weights: A CNN Interpretability Framework

Deep neural networks used for image classification often use convolution...

Please sign up or login with your details

Forgot password? Click here to reset