SELF-VS: Self-supervised Encoding Learning For Video Summarization

03/28/2023
by   Hojjat Mokhtarabadi, et al.
0

Despite its wide range of applications, video summarization is still held back by the scarcity of extensive datasets, largely due to the labor-intensive and costly nature of frame-level annotations. As a result, existing video summarization methods are prone to overfitting. To mitigate this challenge, we propose a novel self-supervised video representation learning method using knowledge distillation to pre-train a transformer encoder. Our method matches its semantic video representation, which is constructed with respect to frame importance scores, to a representation derived from a CNN trained on video classification. Empirical evaluations on correlation-based metrics, such as Kendall's τ and Spearman's ρ demonstrate the superiority of our approach compared to existing state-of-the-art methods in assigning relative scores to the input frames.

READ FULL TEXT

page 1

page 4

page 6

page 7

research
09/26/2021

A Video Summarization Method Using Temporal Interest Detection and Key Frame Prediction

In this paper, a Video Summarization Method using Temporal Interest Dete...
research
01/07/2022

Video Summarization Based on Video-text Modelling

Modern video summarization methods are based on deep neural networks whi...
research
01/05/2023

EgoDistill: Egocentric Head Motion Distillation for Efficient Video Understanding

Recent advances in egocentric video understanding models are promising, ...
research
09/16/2023

FrameRS: A Video Frame Compression Model Composed by Self supervised Video Frame Reconstructor and Key Frame Selector

In this paper, we present frame reconstruction model: FrameRS. It consis...
research
06/02/2023

Masked Autoencoder for Unsupervised Video Summarization

Summarizing a video requires a diverse understanding of the video, rangi...
research
08/23/2017

CNN-Based Prediction of Frame-Level Shot Importance for Video Summarization

In the Internet, ubiquitous presence of redundant, unedited, raw videos ...
research
11/18/2022

Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization

Video summarization aims to select the most informative subset of frames...

Please sign up or login with your details

Forgot password? Click here to reset