Exploring global diverse attention via pairwise temporal relation for video summarization

09/23/2020
by   Ping Li, et al.
6

Video summarization is an effective way to facilitate video searching and browsing. Most of existing systems employ encoder-decoder based recurrent neural networks, which fail to explicitly diversify the system-generated summary frames while requiring intensive computations. In this paper, we propose an efficient convolutional neural network architecture for video SUMmarization via Global Diverse Attention called SUM-GDA, which adapts attention mechanism in a global perspective to consider pairwise temporal relations of video frames. Particularly, the GDA module has two advantages: 1) it models the relations within paired frames as well as the relations among all pairs, thus capturing the global attention across all frames of one video; 2) it reflects the importance of each frame to the whole video, leading to diverse attention on these frames. Thus, SUM-GDA is beneficial for generating diverse frames to form satisfactory video summary. Extensive experiments on three data sets, i.e., SumMe, TVSum, and VTW, have demonstrated that SUM-GDA and its extension outperform other competing state-of-the-art methods with remarkable improvements. In addition, the proposed models can be run in parallel with significantly less computational costs, which helps the deployment in highly demanding applications.

READ FULL TEXT

page 3

page 4

page 5

page 6

page 7

page 8

page 11

page 12

research
01/27/2022

Exploring Global Diversity and Local Context for Video Summarization

Video summarization aims to automatically generate a diverse and concise...
research
06/02/2020

Transfoming Multi-Concept Attention into Video Summarization

Video summarization is among challenging tasks in computer vision, which...
research
01/31/2020

Convolutional Hierarchical Attention Network for Query-Focused Video Summarization

Previous approaches for video summarization mainly concentrate on findin...
research
06/02/2020

Transforming Multi-Concept Attention into Video Summarization

Video summarization is among challenging tasks in computer vision, which...
research
07/16/2023

Self-Attention Based Generative Adversarial Networks For Unsupervised Video Summarization

In this paper, we study the problem of producing a comprehensive video s...
research
07/17/2020

SumGraph: Video Summarization via Recursive Graph Modeling

The goal of video summarization is to select keyframes that are visually...
research
04/17/2019

Cycle-SUM: Cycle-consistent Adversarial LSTM Networks for Unsupervised Video Summarization

In this paper, we present a novel unsupervised video summarization model...

Please sign up or login with your details

Forgot password? Click here to reset