A Survey on Multi-modal Summarization

09/11/2021
by   Anubhav Jangra, et al.
0

The new era of technology has brought us to the point where it is convenient for people to share their opinions over an abundance of platforms. These platforms have a provision for the users to express themselves in multiple forms of representations, including text, images, videos, and audio. This, however, makes it difficult for users to obtain all the key information about a topic, making the task of automatic multi-modal summarization (MMS) essential. In this paper, we present a comprehensive survey of the existing research in the area of MMS.

READ FULL TEXT

page 9

page 10

page 17

page 18

research
09/17/2020

Multi-modal Summarization for Video-containing Documents

Summarization of multimedia data becomes increasingly significant as it ...
research
04/26/2021

GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization

Traditional video summarization methods generate fixed video representat...
research
05/19/2020

Multi-Modal Summary Generation using Multi-Objective Optimization

Significant development of communication technology over the past few ye...
research
06/27/2021

Multi-Modal Chorus Recognition for Improving Song Search

We discuss a novel task, Chorus Recognition, which could potentially ben...
research
05/19/2023

A Topic-aware Summarization Framework with Different Modal Side Information

Automatic summarization plays an important role in the exponential docum...
research
09/21/2019

Video Skimming: Taxonomy and Comprehensive Survey

Video skimming, also known as dynamic video summarization, generates a t...
research
11/13/2021

Memotion Analysis through the Lens of Joint Embedding

Joint embedding (JE) is a way to encode multi-modal data into a vector s...

Please sign up or login with your details

Forgot password? Click here to reset