SalSum: Saliency-based Video Summarization using Generative Adversarial Networks

11/20/2020
by   George Pantazis, et al.
0

The huge amount of video data produced daily by camera-based systems, such as surveilance, medical and telecommunication systems, emerges the need for effective video summarization (VS) methods. These methods should be capable of creating an overview of the video content. In this paper, we propose a novel VS method based on a Generative Adversarial Network (GAN) model pre-trained with human eye fixations. The main contribution of the proposed method is that it can provide perceptually compatible video summaries by combining both perceived color and spatiotemporal visual attention cues in a unsupervised scheme. Several fusion approaches are considered for robustness under uncertainty, and personalization. The proposed method is evaluated in comparison to state-of-the-art VS approaches on the benchmark dataset VSUMM. The experimental results conclude that SalSum outperforms the state-of-the-art approaches by providing the highest f-measure score on the VSUMM benchmark.

READ FULL TEXT

page 7

page 9

page 14

research
07/16/2023

Self-Attention Based Generative Adversarial Networks For Unsupervised Video Summarization

In this paper, we study the problem of producing a comprehensive video s...
research
07/17/2018

Query-Conditioned Three-Player Adversarial Network for Video Summarization

Video summarization plays an important role in video understanding by se...
research
11/23/2020

SCGAN: Saliency Map-guided Colorization with Generative Adversarial Network

Given a grayscale photograph, the colorization system estimates a visual...
research
05/26/2021

Unsupervised Video Summarization via Multi-source Features

Video summarization aims at generating a compact yet representative visu...
research
05/24/2021

Unsupervised Video Summarization with a Convolutional Attentive Adversarial Network

With the explosive growth of video data, video summarization, which atte...
research
01/11/2023

VS-Net: Multiscale Spatiotemporal Features for Lightweight Video Salient Document Detection

Video Salient Document Detection (VSDD) is an essential task of practica...
research
09/06/2021

ERA: Entity Relationship Aware Video Summarization with Wasserstein GAN

Video summarization aims to simplify large scale video browsing by gener...

Please sign up or login with your details

Forgot password? Click here to reset