Statistical theory for image classification using deep convolutional neural networks with cross-entropy loss

11/27/2020
by   Michael Kohler, et al.
0

Convolutional neural networks learned by minimizing the cross-entropy loss are nowadays the standard for image classification. Till now, the statistical theory behind those networks is lacking. We analyze the rate of convergence of the misclassification risk of the estimates towards the optimal misclassification risk. Under suitable assumptions on the smoothness and structure of the aposteriori probability it is shown that these estimates achieve a rate of convergence which is independent of the dimension of the image. The study shed light on the good performance of CNNs learned by cross-entropy loss and partly explains their success in practical applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/03/2020

On the rate of convergence of image classifiers based on convolutional neural networks

Image classifiers based on convolutional neural networks are defined, an...
research
08/28/2023

Entropy-based Guidance of Deep Neural Networks for Accelerated Convergence and Improved Performance

Neural networks have dramatically increased our capacity to learn from l...
research
07/29/2019

Multi-Frame Cross-Entropy Training for Convolutional Neural Networks in Speech Recognition

We introduce Multi-Frame Cross-Entropy training (MFCE) for convolutional...
research
02/09/2021

Enhancing Audio Augmentation Methods with Consistency Learning

Data augmentation is an inexpensive way to increase training data divers...
research
10/20/2017

Unified Backpropagation for Multi-Objective Deep Learning

A common practice in most of deep convolutional neural architectures is ...
research
05/11/2022

Analysis of convolutional neural network image classifiers in a rotationally symmetric model

Convolutional neural network image classifiers are defined and the rate ...
research
09/20/2022

On a waiting-time result of Kontoyiannis: mixing or decoupling?

We introduce conditions of lower decoupling to the study of waiting-time...

Please sign up or login with your details

Forgot password? Click here to reset