Learning Visual Storylines with Skipping Recurrent Neural Networks

04/14/2016
by   Gunnar A. Sigurdsson, et al.
0

What does a typical visit to Paris look like? Do people first take photos of the Louvre and then the Eiffel Tower? Can we visually model a temporal event like "Paris Vacation" using current frameworks? In this paper, we explore how we can automatically learn the temporal aspects, or storylines of visual concepts from web data. Previous attempts focus on consecutive image-to-image transitions and are unsuccessful at recovering the long-term underlying story. Our novel Skipping Recurrent Neural Network (S-RNN) model does not attempt to predict each and every data point in the sequence, like classic RNNs. Rather, S-RNN uses a framework that skips through the images in the photo stream to explore the space of all ordered subsets of the albums via an efficient sampling procedure. This approach reduces the negative impact of strong short-term correlations, and recovers the latent story more accurately. We show how our learned storylines can be used to analyze, predict, and summarize photo albums from Flickr. Our experimental results provide strong qualitative and quantitative evidence that S-RNN is significantly better than other candidate methods such as LSTMs on learning long-term correlations and recovering latent storylines. Moreover, we show how storylines can help machines better understand and summarize photo streams by inferring a brief personalized story of each individual album.

READ FULL TEXT

page 3

page 11

page 12

page 15

page 23

page 24

page 25

page 26

research
06/02/2016

Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network

Visual storytelling aims to generate human-level narrative language (i.e...
research
02/03/2020

Hide-and-Tell: Learning to Bridge Photo Streams for Visual Storytelling

Visual storytelling is a task of creating a short story based on photo s...
research
08/09/2017

Hierarchically-Attentive RNN for Album Summarization and Storytelling

We address the problem of end-to-end visual storytelling. Given a photo ...
research
01/31/2018

Learning Video-Story Composition via Recurrent Neural Network

In this paper, we propose a learning-based method to compose a video-sto...
research
09/11/2018

Long-Term Occupancy Grid Prediction Using Recurrent Neural Networks

We tackle the long-term prediction of scene evolution in a complex downt...
research
04/24/2017

k-FFNN: A priori knowledge infused Feed-forward Neural Networks

Recurrent neural network (RNN) are being extensively used over feed-forw...
research
07/18/2018

General Value Function Networks

In this paper we show that restricting the representation-layer of a Rec...

Please sign up or login with your details

Forgot password? Click here to reset