Training behavior of deep neural network in frequency domain

07/03/2018
by   Zhi-Qin J. Xu, et al.
12

Why deep neural networks (DNNs) capable of overfitting often generalize well in practice is a mystery in deep learning. Existing works indicate that this observation holds for both complicated real datasets and simple datasets of one-dimensional (1-d) functions. In this work, for general low-frequency dominant 1-d functions, we find that a DNN with common settings first quickly captures the dominant low-frequency components, and then relatively slowly captures high-frequency ones. We call this phenomenon Frequency Principle (F-Principle). F-Principle can be observed over various DNN setups of different activation functions, layer structures and training algorithms in our experiments. F-Principle can be used to understand (i) the behavior of DNN training in the information plane and (ii) why DNNs often generalize well albeit its ability of overfitting. This F-Principle potentially can provide insights into understanding the general principle underlying DNN optimization and generalization for real datasets.

READ FULL TEXT

page 10

page 12

research
01/19/2022

Overview frequency principle/spectral bias in deep learning

Understanding deep learning is increasingly emergent as it penetrates mo...
research
06/21/2019

Theory of the Frequency Principle for General Deep Neural Networks

Along with fruitful applications of Deep Neural Networks (DNNs) to reali...
research
11/26/2018

Frequency Principle in Deep Learning with General Loss Functions and Its Potential Application

Previous studies have shown that deep neural networks (DNNs) with common...
research
05/20/2023

Loss Spike in Training Neural Networks

In this work, we study the mechanism underlying loss spikes observed dur...
research
11/13/2020

Neural Network Training Techniques Regularize Optimization Trajectory: An Empirical Study

Modern deep neural network (DNN) trainings utilize various training tech...
research
10/15/2020

On the exact computation of linear frequency principle dynamics and its generalization

Recent works show an intriguing phenomenon of Frequency Principle (F-Pri...
research
01/30/2021

Linear Frequency Principle Model to Understand the Absence of Overfitting in Neural Networks

Why heavily parameterized neural networks (NNs) do not overfit the data ...

Please sign up or login with your details

Forgot password? Click here to reset