Adaptive Signal Variances: CNN Initialization Through Modern Architectures

08/16/2020
by   Takahiko Henmi, et al.
8

Deep convolutional neural networks (CNN) have achieved the unwavering confidence in its performance on image processing tasks. The CNN architecture constitutes a variety of different types of layers including the convolution layer and the max-pooling layer. CNN practitioners widely understand the fact that the stability of learning depends on how to initialize the model parameters in each layer. Nowadays, no one doubts that the de facto standard scheme for initialization is the so-called Kaiming initialization that has been developed by He et al. The Kaiming scheme was derived from a much simpler model than the currently used CNN structure having evolved since the emergence of the Kaiming scheme. The Kaiming model consists only of the convolution and fully connected layers, ignoring the max-pooling layer and the global average pooling layer. In this study, we derived the initialization scheme again not from the simplified Kaiming model, but precisely from the modern CNN architectures, and empirically investigated how the new initialization method performs compared to the de facto standard ones that are widely used today.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

research
04/21/2016

Robust Audio Event Recognition with 1-Max Pooling Convolutional Neural Networks

We present in this paper a simple, yet efficient convolutional neural ne...
research
10/09/2018

Convolutional Neural Networks In Convolution

Currently, increasingly deeper neural networks have been applied to impr...
research
10/25/2016

Maxmin convolutional neural networks for image classification

Convolutional neural networks (CNN) are widely used in computer vision, ...
research
04/30/2019

Investigation of Initialization Strategies for the Multiple Instance Adaptive Cosine Estimator

Sensors which use electromagnetic induction (EMI) to excite a response i...
research
03/11/2015

A Novel Hybrid CNN-AIS Visual Pattern Recognition Engine

Machine learning methods are used today for most recognition problems. C...
research
07/30/2020

Deep learning for lithological classification of carbonate rock micro-CT images

In addition to the ongoing development, pre-salt carbonate reservoir cha...
research
02/21/2017

Convolution Aware Initialization

Initialization of parameters in deep neural networks has been shown to h...

Please sign up or login with your details

Forgot password? Click here to reset