Hierarchically-Attentive RNN for Album Summarization and Storytelling

08/09/2017
by   Licheng Yu, et al.
0

We address the problem of end-to-end visual storytelling. Given a photo album, our model first selects the most representative (summary) photos, and then composes a natural language story for the album. For this task, we make use of the Visual Storytelling dataset and a model composed of three hierarchically-attentive Recurrent Neural Nets (RNNs) to: encode the album photos, select representative (summary) photos, and compose the story. Automatic and human evaluations show our model achieves better performance on selection, generation, and retrieval than baselines.

READ FULL TEXT

page 3

page 5

page 6

research
02/03/2020

Hide-and-Tell: Learning to Bridge Photo Streams for Visual Storytelling

Visual storytelling is a task of creating a short story based on photo s...
research
12/03/2019

Knowledge-Enriched Visual Storytelling

Stories are diverse and highly personalized, resulting in a large possib...
research
04/13/2016

Visual Storytelling

We introduce the first dataset for sequential vision-to-language, and ex...
research
04/14/2016

Learning Visual Storylines with Skipping Recurrent Neural Networks

What does a typical visit to Paris look like? Do people first take photo...
research
09/11/2019

What Makes A Good Story? Designing Composite Rewards for Visual Storytelling

Previous storytelling approaches mostly focused on optimizing traditiona...
research
12/01/2022

DeclutterCam: A Photographic Assistant System with Clutter Detection and Removal

Photographs convey the stories of photographers to the audience. However...
research
06/02/2016

Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network

Visual storytelling aims to generate human-level narrative language (i.e...

Please sign up or login with your details

Forgot password? Click here to reset