Depth Selection for Deep ReLU Nets in Feature Extraction and Generalization

04/01/2020
by   Zhi Han, et al.
0

Deep learning is recognized to be capable of discovering deep features for representation learning and pattern recognition without requiring elegant feature engineering techniques by taking advantage of human ingenuity and prior knowledge. Thus it has triggered enormous research activities in machine learning and pattern recognition. One of the most important challenge of deep learning is to figure out relations between a feature and the depth of deep neural networks (deep nets for short) to reflect the necessity of depth. Our purpose is to quantify this feature-depth correspondence in feature extraction and generalization. We present the adaptivity of features to depths and vice-verse via showing a depth-parameter trade-off in extracting both single feature and composite features. Based on these results, we prove that implementing the classical empirical risk minimization on deep nets can achieve the optimal generalization performance for numerous learning tasks. Our theoretical results are verified by a series of numerical experiments including toy simulations and a real application of earthquake seismic intensity prediction.

READ FULL TEXT
research
12/16/2019

Realization of spatial sparseness by deep ReLU nets with massive data

The great success of deep learning poses urgent challenges for understan...
research
08/31/2010

Pattern Recognition in Collective Cognitive Systems: Hybrid Human-Machine Learning (HHML) By Heterogeneous Ensembles

The ubiquitous role of the cyber-infrastructures, such as the WWW, provi...
research
06/05/2018

Machine Learning for Yield Curve Feature Extraction: Application to Illiquid Corporate Bonds (Preliminary Draft)

This paper studies the application of machine learning in extracting the...
research
03/03/2017

On the Behavior of Convolutional Nets for Feature Extraction

Deep neural networks are representation learning techniques. During trai...
research
08/15/2021

Pattern Inversion as a Pattern Recognition Method for Machine Learning

Artificial neural networks use a lot of coefficients that take a great d...
research
04/28/2021

A Study of the Mathematics of Deep Learning

"Deep Learning"/"Deep Neural Nets" is a technological marvel that is now...
research
06/17/2020

Using Wavelets and Spectral Methods to Study Patterns in Image-Classification Datasets

Deep learning models extract, before a final classification layer, featu...

Please sign up or login with your details

Forgot password? Click here to reset