A Mathematical Theory of Deep Convolutional Neural Networks for Feature Extraction

12/19/2015
by   Thomas Wiatowski, et al.
0

Deep convolutional neural networks have led to breakthrough results in numerous practical machine learning tasks such as classification of images in the ImageNet data set, control-policy-learning to play Atari games or the board game Go, and image captioning. Many of these applications first perform feature extraction and then feed the results thereof into a trainable classifier. The mathematical analysis of deep convolutional neural networks for feature extraction was initiated by Mallat, 2012. Specifically, Mallat considered so-called scattering networks based on a wavelet transform followed by the modulus non-linearity in each network layer, and proved translation invariance (asymptotically in the wavelet scale parameter) and deformation stability of the corresponding feature extractor. This paper complements Mallat's results by developing a theory that encompasses general convolutional transforms, or in more technical parlance, general semi-discrete frames (including Weyl-Heisenberg filters, curvelets, shearlets, ridgelets, wavelets, and learned filters), general Lipschitz-continuous non-linearities (e.g., rectified linear units, shifted logistic sigmoids, hyperbolic tangents, and modulus functions), and general Lipschitz-continuous pooling operators emulating, e.g., sub-sampling and averaging. In addition, all of these elements can be different in different network layers. For the resulting feature extractor we prove a translation invariance result of vertical nature in the sense of the features becoming progressively more translation-invariant with increasing network depth, and we establish deformation sensitivity bounds that apply to signal classes such as, e.g., band-limited functions, cartoon functions, and Lipschitz functions.

READ FULL TEXT

page 7

page 8

research
04/21/2015

Deep Convolutional Neural Networks Based on Semi-Discrete Frames

Deep convolutional neural networks have led to breakthrough results in p...
research
05/26/2016

Discrete Deep Feature Extraction: A Theory and New Architectures

First steps towards a mathematical theory of deep convolutional neural n...
research
04/29/2016

Deep Convolutional Neural Networks on Cartoon Functions

Wiatowski and Bölcskei, 2015, proved that deformation stability and vert...
research
04/12/2017

Energy Propagation in Deep Convolutional Neural Networks

Many practical machine learning tasks employ very deep convolutional neu...
research
07/10/2017

Topology Reduction in Deep Convolutional Feature Extraction Networks

Deep convolutional neural networks (CNNs) used in practice employ potent...
research
07/30/2023

Deep Convolutional Neural Networks with Zero-Padding: Feature Extraction and Learning

This paper studies the performance of deep convolutional neural networks...
research
05/23/2022

Stability of the scattering transform for deformations with minimal regularity

Within the mathematical analysis of deep convolutional neural networks, ...

Please sign up or login with your details

Forgot password? Click here to reset