Structure Learning of Deep Neural Networks with Q-Learning

10/31/2018
by   Guoqiang Zhong, et al.
0

Recently, with convolutional neural networks gaining significant achievements in many challenging machine learning fields, hand-crafted neural networks no longer satisfy our requirements as designing a network will cost a lot, and automatically generating architectures has attracted increasingly more attention and focus. Some research on auto-generated networks has achieved promising results. However, they mainly aim at picking a series of single layers such as convolution or pooling layers one by one. There are many elegant and creative designs in the carefully hand-crafted neural networks, such as Inception-block in GoogLeNet, residual block in residual network and dense block in dense convolutional network. Based on reinforcement learning and taking advantages of the superiority of these networks, we propose a novel automatic process to design a multi-block neural network, whose architecture contains multiple types of blocks mentioned above, with the purpose to do structure learning of deep neural networks and explore the possibility whether different blocks can be composed together to form a well-behaved neural network. The optimal network is created by the Q-learning agent who is trained to sequentially pick different types of blocks. To verify the validity of our proposed method, we use the auto-generated multi-block neural network to conduct experiments on image benchmark datasets MNIST, SVHN and CIFAR-10 image classification task with restricted computational resources. The results demonstrate that our method is very effective, achieving comparable or better performance than hand-crafted networks and advanced auto-generated neural networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/16/2018

BlockQNN: Efficient Block-wise Neural Network Architecture Generation

Convolutional neural networks have gained a remarkable success in comput...
research
10/10/2018

Automatic Configuration of Deep Neural Networks with EGO

Designing the architecture for an artificial neural network is a cumbers...
research
08/26/2021

Scalable and Modular Robustness Analysis of Deep Neural Networks

As neural networks are trained to be deeper and larger, the scalability ...
research
10/03/2020

Cartographic Relief Shading with Neural Networks

Shaded relief is an effective method for visualising terrain on topograp...
research
11/15/2017

AOGNets: Deep AND-OR Grammar Networks for Visual Recognition

This paper presents a method of learning deep AND-OR Grammar (AOG) netwo...
research
08/25/2019

Exploring the Performance of Deep Residual Networks in Crazyhouse Chess

Crazyhouse is a chess variant that incorporates all of the classical che...
research
06/05/2018

EIGEN: Ecologically-Inspired GENetic Approach for Neural Network Structure Searching

Designing the structure of neural networks is considered one of the most...

Please sign up or login with your details

Forgot password? Click here to reset