Theory of the Frequency Principle for General Deep Neural Networks

06/21/2019
by   Tao Luo, et al.
0

Along with fruitful applications of Deep Neural Networks (DNNs) to realistic problems, recently, some empirical studies of DNNs reported a universal phenomenon of Frequency Principle (F-Principle): a DNN tends to learn a target function from low to high frequencies during the training. The F-Principle has been very useful in providing both qualitative and quantitative understandings of DNNs. In this paper, we rigorously investigate the F-Principle for the training dynamics of a general DNN at three stages: initial stage, intermediate stage, and final stage. For each stage, a theorem is provided in terms of proper quantities characterizing the F-Principle. Our results are general in the sense that they work for multilayer networks with general activation functions, population densities of data, and a large class of loss functions. Our work lays a theoretical foundation of the F-Principle for a better understanding of the training process of DNNs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2018

Training behavior of deep neural network in frequency domain

Why deep neural networks (DNNs) capable of overfitting often generalize ...
research
01/19/2019

Frequency Principle: Fourier Analysis Sheds Light on Deep Neural Networks

We study the training process of Deep Neural Networks (DNNs) from the Fo...
research
05/24/2019

Explicitizing an Implicit Bias of the Frequency Principle in Two-layer Neural Networks

It remains a puzzle that why deep neural networks (DNNs), with more para...
research
11/26/2018

Frequency Principle in Deep Learning with General Loss Functions and Its Potential Application

Previous studies have shown that deep neural networks (DNNs) with common...
research
10/15/2020

On the exact computation of linear frequency principle dynamics and its generalization

Recent works show an intriguing phenomenon of Frequency Principle (F-Pri...
research
05/25/2023

Neural (Tangent Kernel) Collapse

This work bridges two important concepts: the Neural Tangent Kernel (NTK...
research
02/27/2018

How (Not) To Train Your Neural Network Using the Information Bottleneck Principle

In this theory paper, we investigate training deep neural networks (DNNs...

Please sign up or login with your details

Forgot password? Click here to reset