Natural language description of images using hybrid recurrent neural network

02/25/2021
by   md-asifuzzaman-jishan, et al.
0

We presented a learning model that generated natural language description of images. The model utilized the connections between natural language and visual data by produced text line based contents from a given image. Our Hybrid Recurrent Neural Network model is based on the intricacies of Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), and Bi-directional Recurrent Neural Network (BRNN) models. We conducted experiments on three benchmark datasets, eg, Flickr8K, Flickr30K, and MS COCO. Our hybrid model utilized LSTM model to encode text line or sentences independent of the object location and BRNN for word representation, this reduced the computational complexities without compromising the accuracy of the descriptor. The model produced better accuracy in retrieving natural language based description on the dataset.

READ FULL TEXT

page 2

page 7

page 9

research
02/25/2021

IMAGETOTEXT: IMAGE CAPTION GENERATION USING HYBRID RECURRENT NEURAL NETWORK

Generating a natural language description from images is an important pr...
research
06/08/2017

Image Captioning with Object Detection and Localization

Automatically generating a natural language description of an image is a...
research
10/09/2019

Kernel-Based Approaches for Sequence Modeling: Connections to Neural Methods

We investigate time-dependent data analysis from the perspective of recu...
research
02/25/2021

Bangla language textual image description by hybrid neural network model

Automatic image captioning task in different language is a challenging t...
research
02/25/2021

Hybrid deep neural network for Bangla automated image descriptor

Automated image to text generation is a computationally challenging comp...
research
08/27/2016

Learning to generalize to new compositions in image understanding

Recurrent neural networks have recently been used for learning to descri...
research
03/17/2022

Knowledge Graph-Enabled Text-Based Automatic Personality Prediction

How people think, feel, and behave, primarily is a representation of the...

Please sign up or login with your details

Forgot password? Click here to reset