-
Multi-modal Deep Analysis for Multimedia
With the rapid development of Internet and multimedia services in the pa...
read it
-
Video Skimming: Taxonomy and Comprehensive Survey
Video skimming, also known as dynamic video summarization, generates a t...
read it
-
Multi-Modal Summary Generation using Multi-Objective Optimization
Significant development of communication technology over the past few ye...
read it
-
Online Multi-modal Person Search in Videos
The task of searching certain people in videos has seen increasing poten...
read it
-
Query-Aware Sparse Coding for Multi-Video Summarization
Given the explosive growth of online videos, it is becoming increasingly...
read it
-
Investigating non-classical correlations between decision fused multi-modal documents
Correlation has been widely used to facilitate various information retri...
read it
-
Improving IT Support by Enhancing Incident Management Process with Multi-modal Analysis
IT support services industry is going through a major transformation wit...
read it
Multi-modal Summarization for Video-containing Documents
Summarization of multimedia data becomes increasingly significant as it is the basis for many real-world applications, such as question answering, Web search, and so forth. Most existing multi-modal summarization works however have used visual complementary features extracted from images rather than videos, thereby losing abundant information. Hence, we propose a novel multi-modal summarization task to summarize from a document and its associated video. In this work, we also build a baseline general model with effective strategies, i.e., bi-hop attention and improved late fusion mechanisms to bridge the gap between different modalities, and a bi-stream summarization strategy to employ text and video summarization simultaneously. Comprehensive experiments show that the proposed model is beneficial for multi-modal summarization and superior to existing methods. Moreover, we collect a novel dataset and it provides a new resource for future study that results from documents and videos.
READ FULL TEXT
Comments
There are no comments yet.