Multi-Task Video Captioning with Video and Entailment Generation

04/24/2017
by   Ramakanth Pasunuru, et al.
0

Video captioning, the task of describing the content of a video, has seen some promising improvements in recent years with sequence-to-sequence models, but accurately learning the temporal and logical dynamics involved in the task still remains a challenge, especially given the lack of sufficient annotated data. We improve video captioning by sharing knowledge with two related directed-generation tasks: a temporally-directed unsupervised video prediction task to learn richer context-aware video encoder representations, and a logically-directed language entailment generation task to learn better video-entailed caption decoder representations. For this, we present a many-to-many multi-task learning model that shares parameters across the encoders and decoders of the three tasks. We achieve significant improvements and the new state-of-the-art on several standard video captioning datasets using diverse automatic and human evaluations. We also show mutual multi-task improvements on the entailment generation task.

READ FULL TEXT
research
08/07/2017

Reinforced Video Captioning with Entailment Rewards

Sequence-to-sequence models have shown promising improvements on the tem...
research
05/28/2018

Soft Layer-Specific Multi-Task Summarization with Entailment and Question Generation

An accurate abstractive summary of a document should contain all its sal...
research
08/30/2019

Multi-Task Learning with Language Modeling for Question Generation

This paper explores the task of answer-aware questions generation. Based...
research
04/12/2022

Video Captioning: a comparative review of where we are and which could be the route

Video captioning is the process of describing the content of a sequence ...
research
06/19/2018

Dynamic Multi-Level Multi-Task Learning for Sentence Simplification

Sentence simplification aims to improve readability and understandabilit...
research
10/28/2018

Middle-Out Decoding

Despite being virtually ubiquitous, sequence-to-sequence models are chal...
research
10/16/2019

Imperial College London Submission to VATEX Video Captioning Task

This paper describes the Imperial College London team's submission to th...

Please sign up or login with your details

Forgot password? Click here to reset