Exploring Global Diversity and Local Context for Video Summarization

01/27/2022
by   Yingchao Pan, et al.
9

Video summarization aims to automatically generate a diverse and concise summary which is useful in large-scale video processing. Most of methods tend to adopt self attention mechanism across video frames, which fails to model the diversity of video frames. To alleviate this problem, we revisit the pairwise similarity measurement in self attention mechanism and find that the existing inner-product affinity leads to discriminative features rather than diversified features. In light of this phenomenon, we propose global diverse attention by using the squared Euclidean distance instead to compute the affinities. Moreover, we model the local contextual information by proposing local contextual attention to remove the redundancy in the video. By combining these two attention mechanism, a video SUMmarization model with Diversified Contextual Attention scheme is developed and named as SUM-DCA. Extensive experiments are conducted on benchmark data sets to verify the effectiveness and the superiority of SUM-DCA in terms of F-score and rank-based evaluation without any bells and whistles.

READ FULL TEXT

page 2

page 5

page 6

page 7

page 8

page 9

page 11

page 12

research
09/23/2020

Exploring global diverse attention via pairwise temporal relation for video summarization

Video summarization is an effective way to facilitate video searching an...
research
07/16/2023

Self-Attention Based Generative Adversarial Networks For Unsupervised Video Summarization

In this paper, we study the problem of producing a comprehensive video s...
research
06/02/2020

Transfoming Multi-Concept Attention into Video Summarization

Video summarization is among challenging tasks in computer vision, which...
research
06/02/2020

Transforming Multi-Concept Attention into Video Summarization

Video summarization is among challenging tasks in computer vision, which...
research
04/23/2021

Supervised Video Summarization via Multiple Feature Sets with Parallel Attention

The assignment of importance scores to particular frames or (short) segm...
research
08/19/2020

Query Twice: Dual Mixture Attention Meta Learning for Video Summarization

Video summarization aims to select representative frames to retain high-...

Please sign up or login with your details

Forgot password? Click here to reset