Multi-View Features and Hybrid Reward Strategies for Vatex Video Captioning Challenge 2019

10/17/2019

∙

This document describes our solution for the VATEX Captioning Challenge 2019, which requires generating descriptions for the videos in both English and Chinese languages. We identified three crucial factors that improve the performance, namely: multi-view features, hybrid reward, and diverse ensemble. Our method achieves the 2nd and the 3rd places on the Chinese and English video captioning tracks, respectively.

READ FULL TEXT

Multi-View Features and Hybrid Reward Strategies for Vatex Video Captioning Challenge 2019

Sign in with Google

Consider DeepAI Pro