Full error analysis for the training of deep neural networks

09/30/2019
by   Christan Beck, et al.
0

Deep learning algorithms have been applied very successfully in recent years to a range of problems out of reach for classical solution paradigms. Nevertheless, there is no completely rigorous mathematical error and convergence analysis which explains the success of deep learning algorithms. The error of a deep learning algorithm can in many situations be decomposed into three parts, the approximation error, the generalization error, and the optimization error. In this work we estimate for a certain deep learning algorithm each of these three errors and combine these three error estimates to obtain an overall error analysis for the deep learning algorithm under consideration. In particular, we thereby establish convergence with a suitable convergence speed for the overall error of the deep learning algorithm under consideration. Our convergence speed analysis is far from optimal and the convergence speed that we establish is rather slow, increases exponentially in the dimensions, and, in particular, suffers from the curse of dimensionality. The main contribution of this work is, instead, to provide a full error analysis (i) which covers each of the three different sources of errors usually emerging in deep learning algorithms and (ii) which merges these three sources of errors into one overall error estimate for the considered deep learning algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/03/2020

Overall error analysis for the training of deep neural networks via stochastic gradient descent with random initialisation

In spite of the accomplishments of deep learning based algorithms in num...
research
12/15/2020

Strong overall error analysis for the training of artificial neural networks via random initializations

Although deep learning based approximation algorithms have been applied ...
research
06/23/2023

A new approach to generalisation error of machine learning algorithms: Estimates and convergence

In this work we consider a model problem of deep neural learning, namely...
research
10/28/2022

Convergence analysis of a quasi-Monte Carlo-based deep learning algorithm for solving partial differential equations

Deep learning methods have achieved great success in solving partial dif...
research
12/01/2020

Quantum-Inspired Classical Algorithm for Slow Feature Analysis

Recently, there has been a surge of interest for quantum computation for...
research
02/07/2021

An Analytic Layer-wise Deep Learning Framework with Applications to Robotics

Deep learning has achieved great success in many applications, but it ha...
research
05/26/2020

Enhancing accuracy of deep learning algorithms by training with low-discrepancy sequences

We propose a deep supervised learning algorithm based on low-discrepancy...

Please sign up or login with your details

Forgot password? Click here to reset