Incorporating Textual Evidence in Visual Storytelling

11/21/2019
by   Tianyi Li, et al.
0

Previous work on visual storytelling mainly focused on exploring image sequence as evidence for storytelling and neglected textual evidence for guiding story generation. Motivated by human storytelling process which recalls stories for familiar images, we exploit textual evidence from similar images to help generate coherent and meaningful stories. To pick the images which may provide textual experience, we propose a two-step ranking method based on image object recognition techniques. To utilize textual information, we design an extended Seq2Seq model with two-channel encoder and attention. Experiments on the VIST dataset show that our method outperforms state-of-the-art baseline models without heavy engineering.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/08/2023

Zero-shot Generation of Coherent Storybook from Plain Text Story using Diffusion Models

Recent advancements in large scale text-to-image models have opened new ...
research
01/20/2023

Visual Writing Prompts: Character-Grounded Story Generation with Curated Image Sequences

Current work on image-based story generation suffers from the fact that ...
research
02/02/2016

Are Elephants Bigger than Butterflies? Reasoning about Sizes of Objects

Human vision greatly benefits from the information about sizes of object...
research
07/12/2020

Sparse Graph to Sequence Learning for Vision Conditioned Long Textual Sequence Generation

Generating longer textual sequences when conditioned on the visual infor...
research
01/27/2023

Reading and Reasoning over Chart Images for Evidence-based Automated Fact-Checking

Evidence data for automated fact-checking (AFC) can be in multiple modal...
research
04/03/2023

Enhancing Clinical Evidence Recommendation with Multi-Channel Heterogeneous Learning on Evidence Graphs

Clinical evidence encompasses the associations and impacts between patie...
research
10/26/2020

Reading Between the Lines: Exploring Infilling in Visual Narratives

Generating long form narratives such as stories and procedures from mult...

Please sign up or login with your details

Forgot password? Click here to reset