Hierarchical Recurrent Neural Network for Video Summarization

04/28/2019
by   Bin Zhao, et al.
0

Exploiting the temporal dependency among video frames or subshots is very important for the task of video summarization. Practically, RNN is good at temporal dependency modeling, and has achieved overwhelming performance in many video-based tasks, such as video captioning and classification. However, RNN is not capable enough to handle the video summarization task, since traditional RNNs, including LSTM, can only deal with short videos, while the videos in the summarization task are usually in longer duration. To address this problem, we propose a hierarchical recurrent neural network for video summarization, called H-RNN in this paper. Specifically, it has two layers, where the first layer is utilized to encode short video subshots cut from the original video, and the final hidden state of each subshot is input to the second layer for calculating its confidence to be a key subshot. Compared to traditional RNNs, H-RNN is more suitable to video summarization, since it can exploit long temporal dependency among frames, meanwhile, the computation operations are significantly lessened. The results on two popular datasets, including the Combined dataset and VTW dataset, have demonstrated that the proposed H-RNN outperforms the state-of-the-arts.

READ FULL TEXT

page 4

page 6

research
09/22/2021

Hierarchical Multimodal Transformer to Summarize Videos

Although video summarization has achieved tremendous success benefiting ...
research
01/31/2018

Learning Video-Story Composition via Recurrent Neural Network

In this paper, we propose a learning-based method to compose a video-sto...
research
11/18/2019

Action Anticipation with RBF KernelizedFeature Mapping RNN

We introduce a novel Recurrent Neural Network-based algorithm for future...
research
11/18/2019

Action Anticipation with RBF Kernelized Feature Mapping RNN

We introduce a novel Recurrent Neural Network-based algorithm for future...
research
03/12/2022

Recurrence-in-Recurrence Networks for Video Deblurring

State-of-the-art video deblurring methods often adopt recurrent neural n...
research
07/03/2019

Compositional Structure Learning for Sequential Video Data

Conventional sequential learning methods such as Recurrent Neural Networ...
research
11/11/2015

Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning

Recently, deep learning approach, especially deep Convolutional Neural N...

Please sign up or login with your details

Forgot password? Click here to reset