SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size

02/24/2016
by   Forrest N. Iandola, et al.
0

Recent research on deep neural networks has focused primarily on improving accuracy. For a given accuracy level, it is typically possible to identify multiple DNN architectures that achieve that accuracy level. With equivalent accuracy, smaller DNN architectures offer at least three advantages: (1) Smaller DNNs require less communication across servers during distributed training. (2) Smaller DNNs require less bandwidth to export a new model from the cloud to an autonomous car. (3) Smaller DNNs are more feasible to deploy on FPGAs and other hardware with limited memory. To provide all of these advantages, we propose a small DNN architecture called SqueezeNet. SqueezeNet achieves AlexNet-level accuracy on ImageNet with 50x fewer parameters. Additionally, with model compression techniques we are able to compress SqueezeNet to less than 0.5MB (510x smaller than AlexNet). The SqueezeNet architecture is available for download here: https://github.com/DeepScale/SqueezeNet

READ FULL TEXT
research
03/30/2023

XPert: Peripheral Circuit Neural Architecture Co-search for Area and Energy-efficient Xbar-based Computing

The hardware-efficiency and accuracy of Deep Neural Networks (DNNs) impl...
research
07/20/2022

Automated machine learning for borehole resistivity measurements

Deep neural networks (DNNs) offer a real-time solution for the inversion...
research
07/07/2018

Anytime Neural Prediction via Slicing Networks Vertically

The pioneer deep neural networks (DNNs) have emerged to be deeper or wid...
research
06/02/2022

DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks

Efficient deep neural network (DNN) models equipped with compact operato...
research
11/24/2021

Hidden-Fold Networks: Random Recurrent Residuals Using Sparse Supermasks

Deep neural networks (DNNs) are so over-parametrized that recent researc...
research
11/03/2020

Parameter Efficient Deep Neural Networks with Bilinear Projections

Recent research on deep neural networks (DNNs) has primarily focused on ...
research
03/13/2020

Partial Weight Adaptation for Robust DNN Inference

Mainstream video analytics uses a pre-trained DNN model with an assumpti...

Please sign up or login with your details

Forgot password? Click here to reset