Peephole: Predicting Network Performance Before Training

12/09/2017
by   Boyang Deng, et al.
0

The quest for performant networks has been a significant force that drives the advancements of deep learning in recent years. While rewarding, improving network design has never been an easy journey. The large design space combined with the tremendous cost required for network training poses a major obstacle to this endeavor. In this work, we propose a new approach to this problem, namely, predicting the performance of a network before training, based on its architecture. Specifically, we develop a unified way to encode individual layers into vectors and bring them together to form an integrated description via LSTM. Taking advantage of the recurrent network's strong expressive power, this method can reliably predict the performances of various network architectures. Our empirical studies showed that it not only achieved accurate predictions but also produced consistent rankings across datasets -- a key desideratum in performance prediction.

READ FULL TEXT

page 3

page 8

research
07/05/2022

Understanding and Improving Group Normalization

Various normalization layers have been proposed to help the training of ...
research
02/10/2020

Taylorized Training: Towards Better Approximation of Neural Network Training at Finite Width

We propose Taylorized training as an initiative towards better understan...
research
09/27/2022

Neural Network Panning: Screening the Optimal Sparse Network Before Training

Pruning on neural networks before training not only compresses the origi...
research
08/25/2019

What are Neural Networks made of?

The success of Deep Learning methods is not well understood, though vari...
research
01/18/2017

On the Performance of Network Parallel Training in Artificial Neural Networks

Artificial Neural Networks (ANNs) have received increasing attention in ...
research
05/25/2021

The Nonlinearity Coefficient - A Practical Guide to Neural Architecture Design

In essence, a neural network is an arbitrary differentiable, parametrize...
research
06/05/2018

Adaptive Importance Learning for Improving Lightweight Image Super-resolution Network

Deep neural networks have achieved remarkable success in single image su...

Please sign up or login with your details

Forgot password? Click here to reset