Syntax Customized Video Captioning by Imitating Exemplar Sentences

12/02/2021
by   Yitian Yuan, et al.
0

Enhancing the diversity of sentences to describe video contents is an important problem arising in recent video captioning research. In this paper, we explore this problem from a novel perspective of customizing video captions by imitating exemplar sentence syntaxes. Specifically, given a video and any syntax-valid exemplar sentence, we introduce a new task of Syntax Customized Video Captioning (SCVC) aiming to generate one caption which not only semantically describes the video contents but also syntactically imitates the given exemplar sentence. To tackle the SCVC task, we propose a novel video captioning model, where a hierarchical sentence syntax encoder is firstly designed to extract the syntactic structure of the exemplar sentence, then a syntax conditioned caption decoder is devised to generate the syntactically structured caption expressing video semantics. As there is no available syntax customized groundtruth video captions, we tackle such a challenge by proposing a new training strategy, which leverages the traditional pairwise video captioning data and our collected exemplar sentences to accomplish the model learning. Extensive experiments, in terms of semantic, syntactic, fluency, and diversity evaluations, clearly demonstrate our model capability to generate syntax-varied and semantics-coherent video captions that well imitate different exemplar sentences with enriched diversities.

READ FULL TEXT

page 3

page 7

page 11

page 12

page 13

research
12/02/2021

Controllable Video Captioning with an Exemplar Sentence

In this paper, we investigate a novel and challenging task, namely contr...
research
08/25/2016

Title Generation for User Generated Videos

A great video title describes the most salient event compactly and captu...
research
06/21/2022

Bypass Network for Semantics Driven Image Paragraph Captioning

Image paragraph captioning aims to describe a given image with a sequenc...
research
08/27/2019

Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network

In this paper, we propose to guide the video caption generation with Par...
research
08/31/2019

A Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling

Given the features of a video, recurrent neural network can be used to a...
research
08/12/2019

Sentence Specified Dynamic Video Thumbnail Generation

With the tremendous growth of videos over the Internet, video thumbnails...
research
07/26/2018

Move Forward and Tell: A Progressive Generator of Video Descriptions

We present an efficient framework that can generate a coherent paragraph...

Please sign up or login with your details

Forgot password? Click here to reset