Supervised Video Summarization via Multiple Feature Sets with Parallel Attention

04/23/2021
by   Junaid Ahmed Ghauri, et al.
0

The assignment of importance scores to particular frames or (short) segments in a video is crucial for summarization, but also a difficult task. Previous work utilizes only one source of visual features. In this paper, we suggest a novel model architecture that combines three feature sets for visual content and motion to predict importance scores. The proposed architecture utilizes an attention mechanism before fusing motion features and features representing the (static) visual content, i.e., derived from an image classification model. Comprehensive experimental evaluations are reported for two well-known datasets, SumMe and TVSum. In this context, we identify methodological issues on how previous work used these benchmark datasets, and present a fair evaluation scheme with appropriate data splits that can be used in future work. When using static and motion features with parallel attention mechanism, we improve state-of-the-art results for SumMe, while being on par with the state of the art for the other dataset.

READ FULL TEXT
research
05/26/2021

Unsupervised Video Summarization via Multi-source Features

Video summarization aims at generating a compact yet representative visu...
research
11/24/2018

Discriminative Feature Learning for Unsupervised Video Summarization

In this paper, we address the problem of unsupervised video summarizatio...
research
01/27/2022

Exploring Global Diversity and Local Context for Video Summarization

Video summarization aims to automatically generate a diverse and concise...
research
10/29/2019

Contrastive Attention Mechanism for Abstractive Sentence Summarization

We propose a contrastive attention mechanism to extend the sequence-to-s...
research
06/02/2020

Transfoming Multi-Concept Attention into Video Summarization

Video summarization is among challenging tasks in computer vision, which...
research
11/27/2019

Automatic Generation of Headlines for Online Math Questions

Mathematical equations are an important part of dissemination and commun...
research
11/23/2016

A dataset and exploration of models for understanding video data through fill-in-the-blank question-answering

While deep convolutional neural networks frequently approach or exceed h...

Please sign up or login with your details

Forgot password? Click here to reset