A Memory Network Approach for Story-based Temporal Summarization of 360° Videos

05/08/2018
by   Sangho Lee, et al.
0

We address the problem of story-based temporal summarization of long 360 videos. We propose a novel memory network model named Past-Future Memory Network (PFMN), in which we first compute the scores of 81 normal field of view (NFOV) region proposals cropped from the input 360 video, and then recover a latent, collective summary using the network with two external memories that store the embeddings of previously selected subshots and future candidate subshots. Our major contributions are two-fold. First, our work is the first to address story-based temporal summarization of 360 videos. Second, our model is the first attempt to leverage memory networks for video summarization tasks. For evaluation, we perform three sets of experiments. First, we investigate the view selection capability of our model on the Pano2Vid dataset. Second, we evaluate the temporal summarization with a newly collected 360 video dataset. Finally, we experiment our model's performance in another domain, with image-based storytelling VIST dataset. We verify that our model achieves state-of-the-art performance on all the tasks.

READ FULL TEXT

page 4

page 8

research
10/14/2019

TruNet: Short Videos Generation from Long Videos via Story-Preserving Truncation

In this work, we introduce a new problem, named as story-preserving lon...
research
01/31/2018

A Deep Ranking Model for Spatio-Temporal Highlight Detection from a 360 Video

We address the problem of highlight detection from a 360 degree video by...
research
11/24/2018

Discriminative Feature Learning for Unsupervised Video Summarization

In this paper, we address the problem of unsupervised video summarizatio...
research
07/16/2017

Query-Focused Video Summarization: Dataset, Evaluation, and A Memory Network Based Approach

Recent years have witnessed a resurgence of interest in video summarizat...
research
07/25/2018

Video Storytelling

Bridging vision and natural language is a longstanding goal in computer ...
research
10/05/2016

Recognizing and Presenting the Storytelling Video Structure with Deep Multimodal Networks

This paper presents a novel approach for temporal and semantic segmentat...
research
11/02/2018

Abstractive Summarization of Reddit Posts with Multi-level Memory Networks

We address the problem of abstractive summarization in two directions: p...

Please sign up or login with your details

Forgot password? Click here to reset