Model Complexity-Accuracy Trade-off for a Convolutional Neural Network

05/09/2017
by   Atul Dhingra, et al.
0

Convolutional Neural Networks(CNN) has had a great success in the recent past, because of the advent of faster GPUs and memory access. CNNs are really powerful as they learn the features from data in layers such that they exhibit the structure of the V-1 features of the human brain. A huge bottleneck, in this case, is that CNNs are very large and have a very high memory footprint, and hence they cannot be employed on devices with limited storage such as mobile phone, IoT etc. In this work, we study the model complexity versus accuracy trade-off on MNSIT dataset, and give a concrete framework for handling such a problem, given the worst case accuracy that a system can tolerate. In our work, we reduce the model complexity by 236 times, and memory footprint by 19.5 times compared to the base model while achieving worst case accuracy threshold.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/12/2016

Optimizing Memory Efficiency for Deep Convolutional Neural Networks on GPUs

Leveraging large data sets, deep Convolutional Neural Networks (CNNs) ac...
research
02/01/2023

Constant RMR Recoverable Mutex under System-wide Crashes

We design two Recoverable Mutual Exclusion (RME) locks for the system-wi...
research
04/28/2018

Low-memory convolutional neural networks through incremental depth-first processing

We introduce an incremental processing scheme for convolutional neural n...
research
12/21/2015

Quantized Convolutional Neural Networks for Mobile Devices

Recently, convolutional neural networks (CNN) have demonstrated impressi...
research
02/12/2020

Retrain or not retrain? – efficient pruning methods of deep CNN networks

Convolutional neural networks (CNN) play a major role in image processin...
research
08/06/2023

Nucleotide String Indexing using Range Matching

The two most common data-structures for genome indexing, FM-indices and ...
research
11/28/2016

Efficient Convolutional Auto-Encoding via Random Convexification and Frequency-Domain Minimization

The omnipresence of deep learning architectures such as deep convolution...

Please sign up or login with your details

Forgot password? Click here to reset