Neural Network Architecture Search with Differentiable Cartesian Genetic Programming for Regression

07/03/2019
by   Marcus Märtens, et al.
0

The ability to design complex neural network architectures which enable effective training by stochastic gradient descent has been the key for many achievements in the field of deep learning. However, developing such architectures remains a challenging and resourceintensive process full of trial-and-error iterations. All in all, the relation between the network topology and its ability to model the data remains poorly understood. We propose to encode neural networks with a differentiable variant of Cartesian Genetic Programming (dCGPANN) and present a memetic algorithm for architecture design: local searches with gradient descent learn the network parameters while evolutionary operators act on the dCGPANN genes shaping the network architecture towards faster learning. Studying a particular instance of such a learning scheme, we are able to improve the starting feed forward topology by learning how to rewire and prune links, adapt activation functions and introduce skip connections for chosen regression tasks. The evolved network architectures require less space for network parameters and reach, given the same amount of time, a significantly lower error on average.

READ FULL TEXT
research
08/25/2019

What are Neural Networks made of?

The success of Deep Learning methods is not well understood, though vari...
research
10/08/2019

Differentiable Sparsification for Deep Neural Networks

A deep neural network has relieved the burden of feature engineering by ...
research
06/24/2020

Architopes: An Architecture Modification for Composite Pattern Learning, Increased Expressiveness, and Reduced Training Time

We introduce a simple neural network architecture modification that enab...
research
11/09/2022

Designing Network Design Strategies Through Gradient Path Analysis

Designing a high-efficiency and high-quality expressive network architec...
research
12/02/2019

A Random Matrix Perspective on Mixtures of Nonlinearities for Deep Learning

One of the distinguishing characteristics of modern deep learning system...
research
07/16/2019

Compositional Deep Learning

Neural networks have become an increasingly popular tool for solving man...
research
06/22/2020

Neural networks adapting to datasets: learning network size and topology

We introduce a flexible setup allowing for a neural network to learn bot...

Please sign up or login with your details

Forgot password? Click here to reset