Hierarchical Transfer Convolutional Neural Networks for Image Classification

03/30/2018
by   Xishuang Dong, et al.
0

In this paper, we address the issue of how to enhance the generalization performance of convolutional neural networks (CNN) in the early learning stage for image classification. This is motivated by real-time applications that require the generalization performance of CNN to be satisfactory within limited training time. In order to achieve this, a novel hierarchical transfer CNN framework is proposed. It consists of a group of shallow CNNs and a cloud CNN, where the shallow CNNs are trained firstly and then the first layers of the trained shallow CNNs are used to initialize the first layer of the cloud CNN. This method will boost the generalization performance of the cloud CNN significantly, especially during the early stage of training. Experiments using CIFAR-10 and ImageNet datasets are performed to examine the proposed method. Results demonstrate the improvement of testing accuracy is 12 as much as 20 the ImageNet case during the early stage of learning. It is also shown that universal improvements of testing accuracy are obtained across different settings of dropout and number of shallow CNNs.

READ FULL TEXT
research
01/21/2019

Impact of Fully Connected Layers on Performance of Convolutional Neural Networks for Image Classification

The Convolutional Neural Networks (CNNs), in domains like computer visio...
research
08/31/2016

Convolutional Neural Networks for Text Categorization: Shallow Word-level vs. Deep Character-level

This paper reports the performances of shallow word-level convolutional ...
research
10/25/2016

Maxmin convolutional neural networks for image classification

Convolutional neural networks (CNN) are widely used in computer vision, ...
research
12/29/2018

Greedy Layerwise Learning Can Scale to ImageNet

Shallow supervised 1-hidden layer neural networks have a number of favor...
research
12/18/2019

Tangent Space Separability in Feedforward Neural Networks

Hierarchical neural networks are exponentially more efficient than their...
research
11/11/2018

HSD-CNN: Hierarchically self decomposing CNN architecture using class specific filter sensitivity analysis

Conventional Convolutional neural networks (CNN) are trained on large do...
research
02/10/2021

Two Novel Performance Improvements for Evolving CNN Topologies

Convolutional Neural Networks (CNNs) are the state-of-the-art algorithms...

Please sign up or login with your details

Forgot password? Click here to reset