Convolution Neural Network Hyperparameter Optimization Using Simplified Swarm Optimization

03/06/2021
by   Wei-Chang Yeh, et al.
0

Among the machine learning approaches applied in computer vision, Convolutional Neural Network (CNN) is widely used in the field of image recognition. However, although existing CNN models have been proven to be efficient, it is not easy to find a network architecture with better performance. Some studies choose to optimize the network architecture, while others chose to optimize the hyperparameters, such as the number and size of convolutional kernels, convolutional strides, pooling size, etc. Most of them are designed manually, which requires relevant expertise and takes a lot of time. Therefore, this study proposes the idea of applying Simplified Swarm Optimization (SSO) on the hyperparameter optimization of LeNet models while using MNIST, Fashion MNIST, and Cifar10 as validation. The experimental results show that the proposed algorithm has higher accuracy than the original LeNet model, and it only takes a very short time to find a better hyperparameter configuration after training. In addition, we also analyze the output shape of the feature map after each layer, and surprisingly, the results were mostly rectangular. The contribution of the study is to provide users with a simpler way to get better results with the existing model., and this study can also be applied to other CNN architectures.

READ FULL TEXT

page 13

page 16

page 17

research
10/07/2016

Optimization of Convolutional Neural Network using Microcanonical Annealing Algorithm

Convolutional neural network (CNN) is one of the most prominent architec...
research
01/16/2020

Optimization of Convolutional Neural Network Using the Linearly Decreasing Weight Particle Swarm Optimization

Convolutional neural network (CNN) is one of the most frequently used de...
research
12/02/2019

ExperienceThinking: Hyperparameter Optimization with Budget Constraints

The problem of hyperparameter optimization exists widely in the real lif...
research
12/11/2021

Optimization of Residual Convolutional Neural Network for Electrocardiogram Classification

The interpretation of the electrocardiogram (ECG) gives clinical informa...
research
06/10/2023

Optimized Three Deep Learning Models Based-PSO Hyperparameters for Beijing PM2.5 Prediction

Deep learning is a machine learning approach that produces excellent per...
research
01/12/2021

Convolutional Neural Network Simplification with Progressive Retraining

Kernel pruning methods have been proposed to speed up, simplify, and imp...
research
06/08/2023

Ambulance Demand Prediction via Convolutional Neural Networks

Minimizing response times is crucial for emergency medical services to r...

Please sign up or login with your details

Forgot password? Click here to reset