Towards Coherent Visual Storytelling with Ordered Image Attention

08/04/2021
by   Tom Braude, et al.
0

We address the problem of visual storytelling, i.e., generating a story for a given sequence of images. While each sentence of the story should describe a corresponding image, a coherent story also needs to be consistent and relate to both future and past images. To achieve this we develop ordered image attention (OIA). OIA models interactions between the sentence-corresponding image and important regions in other images of the sequence. To highlight the important objects, a message-passing-like algorithm collects representations of those objects in an order-aware manner. To generate the story's sentences, we then highlight important image attention vectors with an Image-Sentence Attention (ISA). Further, to alleviate common linguistic mistakes like repetitiveness, we introduce an adaptive prior. The obtained results improve the METEOR score on the VIST dataset by 1 coherency improvements and shows that OIA and ISA generated stories are more focused, shareable, and image-grounded.

READ FULL TEXT

page 2

page 3

page 8

research
05/28/2018

GLAC Net: GLocal Attention Cascading Networks for Multi-image Cued Story Generation

The task of multi-image cued story generation, such as visual storytelli...
research
01/20/2023

Visual Writing Prompts: Character-Grounded Story Generation with Curated Image Sequences

Current work on image-based story generation suffers from the fact that ...
research
05/30/2018

Using Inter-Sentence Diverse Beam Search to Reduce Redundancy in Visual Storytelling

Visual storytelling includes two important parts: coherence between the ...
research
05/21/2018

Hierarchically Structured Reinforcement Learning for Topically Coherent Visual Story Generation

We propose a hierarchically structured reinforcement learning approach t...
research
06/23/2016

Sort Story: Sorting Jumbled Images and Captions into Stories

Temporal common sense has applications in AI tasks such as QA, multi-doc...
research
11/24/2019

Neural Storyboard Artist: Visualizing Stories with Coherent Image Sequences

A storyboard is a sequence of images to illustrate a story containing mu...
research
10/06/2022

Vision Transformer Based Model for Describing a Set of Images as a Story

Visual Story-Telling is the process of forming a multi-sentence story fr...

Please sign up or login with your details

Forgot password? Click here to reset