Mixup of Feature Maps in a Hidden Layer for Training of Convolutional Neural Network

06/24/2019
by   Hideki Oki, et al.
1

The deep Convolutional Neural Network (CNN) became very popular as a fundamental technique for image classification and objects recognition. To improve the recognition accuracy for the more complex tasks, deeper networks have being introduced. However, the recognition accuracy of the trained deep CNN drastically decreases for the samples which are obtained from the outside regions of the training samples. To improve the generalization ability for such samples, Krizhevsky et al. proposed to generate additional samples through transformations from the existing samples and to make the training samples richer. This method is known as data augmentation. Hongyi Zhang et al. introduced data augmentation method called mixup which achieves state-of-the-art performance in various datasets. Mixup generates new samples by mixing two different training samples. Mixing of the two images is implemented with simple image morphing. In this paper, we propose to apply mixup to the feature maps in a hidden layer. To implement the mixup in the hidden layer we use the Siamese network or the triplet network architecture to mix feature maps. From the experimental comparison, it is observed that the mixup of the feature maps obtained from the first convolution layer is more effective than the original image mixup.

READ FULL TEXT

page 3

page 9

research
06/14/2020

PatchUp: A Regularization Technique for Convolutional Neural Networks

Large capacity deep learning models are often prone to a high generaliza...
research
02/17/2023

Random Padding Data Augmentation

The convolutional neural network (CNN) learns the same object in differe...
research
08/20/2015

DeepWriterID: An End-to-end Online Text-independent Writer Identification System

Owing to the rapid growth of touchscreen mobile terminals and pen-based ...
research
10/24/2019

Superposition as Data Augmentation using LSTM and HMM in Small Training Sets

Considering audio and image data as having quantum nature (data are repr...
research
11/12/2019

Trainable Spectrally Initializable Matrix Transformations in Convolutional Neural Networks

In this work, we investigate the application of trainable and spectrally...
research
05/23/2022

Discriminative Feature Learning through Feature Distance Loss

Convolutional neural networks have shown remarkable ability to learn dis...
research
04/04/2020

ObjectNet Dataset: Reanalysis and Correction

Recently, Barbu et al introduced a dataset called ObjectNet which includ...

Please sign up or login with your details

Forgot password? Click here to reset