Efficient CNN-LSTM based Image Captioning using Neural Network Compression

12/17/2020
by   Harshit Rampal, et al.
0

Modern Neural Networks are eminent in achieving state of the art performance on tasks under Computer Vision, Natural Language Processing and related verticals. However, they are notorious for their voracious memory and compute appetite which further obstructs their deployment on resource limited edge devices. In order to achieve edge deployment, researchers have developed pruning and quantization algorithms to compress such networks without compromising their efficacy. Such compression algorithms are broadly experimented on standalone CNN and RNN architectures while in this work, we present an unconventional end to end compression pipeline of a CNN-LSTM based Image Captioning model. The model is trained using VGG16 or ResNet50 as an encoder and an LSTM decoder on the flickr8k dataset. We then examine the effects of different compression architectures on the model and design a compression architecture that achieves a 73.1 reduction in inference time and a 7.7 its uncompressed counterpart.

READ FULL TEXT
research
06/19/2020

Hyperparameter Analysis for Image Captioning

In this paper, we perform a thorough sensitivity analysis on state-of-th...
research
03/05/2021

Environmental Sound Classification on the Edge: A Pipeline for Deep Acoustic Networks on Extremely Resource-Constrained Devices

Significant efforts are being invested to bring state-of-the-art classif...
research
08/18/2016

Full Resolution Image Compression with Recurrent Neural Networks

This paper presents a set of full-resolution lossy image compression met...
research
10/07/2021

End-to-End Supermask Pruning: Learning to Prune Image Captioning Models

With the advancement of deep models, research work on image captioning h...
research
05/14/2021

Empirical Analysis of Image Caption Generation using Deep Learning

Automated image captioning is one of the applications of Deep Learning w...
research
02/14/2020

ResCap V1: Deep Residual Learning Based Image Captioning

Image Captioning alludes to the process of generating text description f...
research
08/28/2019

Image Captioning with Sparse Recurrent Neural Network

Recurrent Neural Network (RNN) has been deployed as the de facto model t...

Please sign up or login with your details

Forgot password? Click here to reset