Image to Bengali Caption Generation Using Deep CNN and Bidirectional Gated Recurrent Unit

12/22/2020
by   Al Momin Faruk, et al.
1

There is very little notable research on generating descriptions of the Bengali language. About 243 million people speak in Bengali, and it is the 7th most spoken language on the planet. The purpose of this research is to propose a CNN and Bidirectional GRU based architecture model that generates natural language captions in the Bengali language from an image. Bengali people can use this research to break the language barrier and better understand each other's perspectives. It will also help many blind people with their everyday lives. This paper used an encoder-decoder approach to generate captions. We used a pre-trained Deep convolutional neural network (DCNN) called InceptonV3image embedding model as the encoder for analysis, classification, and annotation of the dataset's images Bidirectional Gated Recurrent unit (BGRU) layer as the decoder to generate captions. Argmax and Beam search is used to produce the highest possible quality of the captions. A new dataset called BNATURE is used, which comprises 8000 images with five captions per image. It is used for training and testing the proposed model. We obtained BLEU-1, BLEU-2, BLEU-3, BLEU-4 and Meteor is 42.6, 27.95, 23, 66, 16.41, 28.7 respectively.

READ FULL TEXT

page 1

page 4

research
10/24/2021

Bangla Image Caption Generation through CNN-Transformer based Encoder-Decoder Network

Automatic Image Captioning is the never-ending effort of creating syntac...
research
09/28/2016

Variational Autoencoder for Deep Learning of Images, Labels and Captions

A novel variational autoencoder is developed to model images, as well as...
research
08/02/2019

Falls Prediction in eldery people using Gated Recurrent Units

Falls prevention, especially in older people, becomes an increasingly im...
research
10/27/2019

Memeify: A Large-Scale Meme Generation System

Interest in the research areas related to meme propagation and generatio...
research
04/30/2020

memeBot: Towards Automatic Image Meme Generation

Image memes have become a widespread tool used by people for interacting...
research
06/08/2018

Dank Learning: Generating Memes Using Deep Neural Networks

We introduce a novel meme generation system, which given any image can p...
research
06/26/2019

A Deep Decoder Structure Based on WordEmbedding Regression for An Encoder-Decoder Based Model for Image Captioning

Generating textual descriptions for images has been an attractive proble...

Please sign up or login with your details

Forgot password? Click here to reset