Video Summarization with Long Short-term Memory

05/26/2016
by   Ke Zhang, et al.
0

We propose a novel supervised learning technique for summarizing videos by automatically selecting keyframes or key subshots. Casting the problem as a structured prediction problem on sequential data, our main idea is to use Long Short-Term Memory (LSTM), a special type of recurrent neural networks to model the variable-range dependencies entailed in the task of video summarization. Our learning models attain the state-of-the-art results on two benchmark video datasets. Detailed analysis justifies the design of the models. In particular, we show that it is crucial to take into consideration the sequential structures in videos and model them. Besides advances in modeling techniques, we introduce techniques to address the need of a large number of annotated data for training complex learning models. There, our main idea is to exploit the existence of auxiliary annotated video datasets, albeit heterogeneous in visual styles and contents. Specifically, we show domain adaptation techniques can improve summarization by reducing the discrepancies in statistical properties across those datasets.

READ FULL TEXT

page 13

page 14

research
07/14/2017

Simplified Long Short-term Memory Recurrent Neural Networks: part II

This is part II of three-part work. Here, we present a second set of int...
research
11/24/2017

For Your Eyes Only: Learning to Summarize First-Person Videos

With the increasing amount of video data, it is desirable to highlight o...
research
10/06/2015

Unsupervised Extraction of Video Highlights Via Robust Recurrent Auto-encoders

With the growing popularity of short-form video sharing platforms such a...
research
11/28/2020

Movement Tracks for the Automatic Detection of Fish Behavior in Videos

Global warming is predicted to profoundly impact ocean ecosystems. Fish ...
research
05/10/2021

Reconstructive Sequence-Graph Network for Video Summarization

Exploiting the inner-shot and inter-shot dependencies is essential for k...
research
02/19/2018

LSTM stack-based Neural Multi-sequence Alignment TeCHnique (NeuMATCH)

The alignment of heterogeneous sequential data (video to text) is an imp...
research
06/15/2021

On the Evaluation of Sequential Machine Learning for Network Intrusion Detection

Recent advances in deep learning renewed the research interests in machi...

Please sign up or login with your details

Forgot password? Click here to reset