Masked Autoencoder for Unsupervised Video Summarization

06/02/2023
by   Minho Shim, et al.
0

Summarizing a video requires a diverse understanding of the video, ranging from recognizing scenes to evaluating how much each frame is essential enough to be selected as a summary. Self-supervised learning (SSL) is acknowledged for its robustness and flexibility to multiple downstream tasks, but the video SSL has not shown its value for dense understanding tasks like video summarization. We claim an unsupervised autoencoder with sufficient self-supervised learning does not need any extra downstream architecture design or fine-tuning weights to be utilized as a video summarization model. The proposed method to evaluate the importance score of each frame takes advantage of the reconstruction score of the autoencoder's decoder. We evaluate the method in major unsupervised video summarization benchmarks to show its effectiveness under various experimental settings.

READ FULL TEXT

page 3

page 7

research
03/27/2022

How Severe is Benchmark-Sensitivity in Video Self-Supervised Learning?

Despite the recent success of video self-supervised learning, there is m...
research
04/07/2020

Query-controllable Video Summarization

When video collections become huge, how to explore both within and acros...
research
11/07/2022

On minimal variations for unsupervised representation learning

Unsupervised representation learning aims at describing raw data efficie...
research
03/28/2023

SELF-VS: Self-supervised Encoding Learning For Video Summarization

Despite its wide range of applications, video summarization is still hel...
research
09/16/2023

FrameRS: A Video Frame Compression Model Composed by Self supervised Video Frame Reconstructor and Key Frame Selector

In this paper, we present frame reconstruction model: FrameRS. It consis...
research
02/27/2023

EDMAE: An Efficient Decoupled Masked Autoencoder for Standard View Identification in Pediatric Echocardiography

We propose an efficient decoupled mask autoencoder (EDMAE) for standard ...
research
06/01/2023

Speech Self-Supervised Representation Benchmarking: Are We Doing it Right?

Self-supervised learning (SSL) has recently allowed leveraging large dat...

Please sign up or login with your details

Forgot password? Click here to reset