Neural Network Architecture Optimization through Submodularity and Supermodularity

09/01/2016
by   Junqi Jin, et al.
0

Deep learning models' architectures, including depth and width, are key factors influencing models' performance, such as test accuracy and computation time. This paper solves two problems: given computation time budget, choose an architecture to maximize accuracy, and given accuracy requirement, choose an architecture to minimize computation time. We convert this architecture optimization into a subset selection problem. With accuracy's submodularity and computation time's supermodularity, we propose efficient greedy optimization algorithms. The experiments demonstrate our algorithm's ability to find more accurate models or faster models. By analyzing architecture evolution with growing time budget, we discuss relationships among accuracy, time and architecture, and give suggestions on neural network architecture design.

READ FULL TEXT
research
11/20/2021

Quality and Computation Time in Optimization Problems

Optimization problems are crucial in artificial intelligence. Optimizati...
research
05/22/2018

CascadeCNN: Pushing the performance limits of quantisation

This work presents CascadeCNN, an automated toolflow that pushes the qua...
research
12/01/2017

Probabilistic Adaptive Computation Time

We present a probabilistic model with discrete latent variables that con...
research
06/04/2019

KarNet: An Efficient Boolean Function Simplifier

Many approaches such as Quine-McCluskey algorithm, Karnaugh map solving,...
research
07/30/2018

ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design

Currently, the neural network architecture design is mostly guided by th...
research
09/07/2022

Effects of Archive Size on Computation Time and Solution Quality for Multi-Objective Optimization

An unbounded external archive has been used to store all nondominated so...
research
03/29/2016

Adaptive Computation Time for Recurrent Neural Networks

This paper introduces Adaptive Computation Time (ACT), an algorithm that...

Please sign up or login with your details

Forgot password? Click here to reset