Smaller generalization error derived for deep compared to shallow residual neural networks

10/05/2020
by   Aku Kammonen, et al.
0

Estimates of the generalization error are proved for a residual neural network with L random Fourier features layers z̅_ℓ+1=z̅_ℓ + Re∑_k=1^Kb̅_ℓ ke^ iω_ℓ kz̅_ℓ+ Re∑_k=1^Kc̅_ℓ ke^ iω'_ℓ k· x. An optimal distribution for the frequencies (ω_ℓ k,ω'_ℓ k) of the random Fourier features e^ iω_ℓ kz̅_ℓ and e^ iω'_ℓ k· x is derived. The derivation is based on the corresponding generalization error to approximate function values f(x). The generalization error turns out to be smaller than the estimate f̂^2_L^1(ℝ^d)/(LK) of the generalization error for random Fourier features with one hidden layer and the same total number of nodes LK, in the case the L^∞-norm of f is much less than the L^1-norm of its Fourier transform f̂. This understanding of an optimal distribution for random features is used to construct a new training method for a deep residual network that shows promising results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/15/2019

On the Generalization Properties of Minimum-norm Solutions for Over-parameterized Neural Network Models

We study the generalization properties of minimum-norm solutions for thr...
research
10/25/2022

A Deep Fourier Residual Method for solving PDEs using Neural Networks

When using Neural Networks as trial functions to numerically solve PDEs,...
research
07/21/2020

Adaptive random Fourier features with Metropolis sampling

The supervised learning problem to determine a neural network approximat...
research
02/04/2021

Wind Field Reconstruction with Adaptive Random Fourier Features

We investigate the use of spatial interpolation methods for reconstructi...
research
03/06/2019

A Priori Estimates of the Population Risk for Residual Networks

Optimal a priori estimates are derived for the population risk of a regu...
research
10/30/2018

Pseudo-Bayesian Learning with Kernel Fourier Transform as Prior

We revisit Rahimi and Recht (2007)'s kernel random Fourier features (RFF...
research
06/15/2020

Weighted Optimization: better generalization by smoother interpolation

We provide a rigorous analysis of how implicit bias towards smooth inter...

Please sign up or login with your details

Forgot password? Click here to reset