NITS-VC System for VATEX Video Captioning Challenge 2020

06/07/2020
by   Alok Singh, et al.
0

Video captioning is process of summarising the content, event and action of the video into a short textual form which can be helpful in many research areas such as video guided machine translation, video sentiment analysis and providing aid to needy individual. In this paper, a system description of the framework used for VATEX-2020 video captioning challenge is presented. We employ an encoder-decoder based approach in which the visual features of the video are encoded using 3D convolutional neural network (C3D) and in the decoding phase two Long Short Term Memory (LSTM) recurrent networks are used in which visual features and input captions are fused separately and final output is generated by performing element-wise product between the output of both LSTMs. Our model is able to achieve BLEU scores of 0.20 and 0.22 on public and private test data sets respectively.

READ FULL TEXT
research
06/15/2016

Bidirectional Long-Short Term Memory for Video Description

Video captioning has been attracting broad research attention in multime...
research
11/17/2016

Multimodal Memory Modelling for Video Captioning

Video captioning which automatically translates video clips into natural...
research
04/04/2019

An End-to-End Baseline for Video Captioning

Building correspondences across different modalities, such as video and ...
research
06/01/2018

Synchronous Prediction of Arousal and Valence Using LSTM Network for Affective Video Content Analysis

The affect embedded in video data conveys high-level semantic informatio...
research
12/02/2021

Controllable Video Captioning with an Exemplar Sentence

In this paper, we investigate a novel and challenging task, namely contr...
research
12/09/2015

Video captioning with recurrent networks based on frame- and video-level features and visual content classification

In this paper, we describe the system for generating textual description...
research
01/26/2018

A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction

We improve automatic correction of grammatical, orthographic, and colloc...

Please sign up or login with your details

Forgot password? Click here to reset