ERA: Entity Relationship Aware Video Summarization with Wasserstein GAN

09/06/2021
by   Guande Wu, et al.
0

Video summarization aims to simplify large scale video browsing by generating concise, short summaries that diver from but well represent the original video. Due to the scarcity of video annotations, recent progress for video summarization concentrates on unsupervised methods, among which the GAN based methods are most prevalent. This type of methods includes a summarizer and a discriminator. The summarized video from the summarizer will be assumed as the final output, only if the video reconstructed from this summary cannot be discriminated from the original one by the discriminator. The primary problems of this GAN based methods are two folds. First, the summarized video in this way is a subset of original video with low redundancy and contains high priority events/entities. This summarization criterion is not enough. Second, the training of the GAN framework is not stable. This paper proposes a novel Entity relationship Aware video summarization method (ERA) to address the above problems. To be more specific, we introduce an Adversarial Spatio Temporal network to construct the relationship among entities, which we think should also be given high priority in the summarization. The GAN training problem is solved by introducing the Wasserstein GAN and two newly proposed video patch/score sum losses. In addition, the score sum loss can also relieve the model sensitivity to the varying video lengths, which is an inherent problem for most current video analysis tasks. Our method substantially lifts the performance on the target benchmark datasets and exceeds the current leaderboard Rank 1 state of the art CSNet (2.1 3.1 approach will shed some light on the future research of unsupervised video summarization.

READ FULL TEXT

page 2

page 8

research
04/30/2018

DTR-GAN: Dilated Temporal Relational Adversarial Network for Video Summarization

The large amount of videos popping up every day, make it is more and mor...
research
04/17/2019

Cycle-SUM: Cycle-consistent Adversarial LSTM Networks for Unsupervised Video Summarization

In this paper, we present a novel unsupervised video summarization model...
research
09/30/2021

IntentVizor: Towards Generic Query Guided Interactive Video Summarization Using Slow-Fast Graph Convolutional Networks

The target of automatic Video summarization is to create a short skim of...
research
12/08/2019

ILS-SUMM: Iterated Local Search for Unsupervised Video Summarization

In recent years, there has been an increasing interest in building video...
research
07/17/2018

Query-Conditioned Three-Player Adversarial Network for Video Summarization

Video summarization plays an important role in video understanding by se...
research
07/13/2017

Query-Aware Sparse Coding for Multi-Video Summarization

Given the explosive growth of online videos, it is becoming increasingly...
research
11/20/2020

SalSum: Saliency-based Video Summarization using Generative Adversarial Networks

The huge amount of video data produced daily by camera-based systems, su...

Please sign up or login with your details

Forgot password? Click here to reset