Every picture tells a story: Image-grounded controllable stylistic story generation

09/04/2022
by   Holy Lovenia, et al.
8

Generating a short story out of an image is arduous. Unlike image captioning, story generation from an image poses multiple challenges: preserving the story coherence, appropriately assessing the quality of the story, steering the generated story into a certain style, and addressing the scarcity of image-story pair reference datasets limiting supervision during training. In this work, we introduce Plug-and-Play Story Teller (PPST) and improve image-to-story generation by: 1) alleviating the data scarcity problem by incorporating large pre-trained models, namely CLIP and GPT-2, to facilitate a fluent image-to-text generation with minimal supervision, and 2) enabling a more style-relevant generation by incorporating stylistic adapters to control the story generation. We conduct image-to-story generation experiments with non-styled, romance-styled, and action-styled PPST approaches and compare our generated stories with those of previous work over three aspects, i.e., story coherence, image-story relevance, and style fitness, using both automatic and human evaluation. The results show that PPST improves story coherence and has better image-story relevance, but has yet to be adequately stylistic.

READ FULL TEXT

page 1

page 8

research
05/18/2021

Stylized Story Generation with Style-Guided Planning

Current storytelling systems focus more ongenerating stories with cohere...
research
10/22/2022

EtriCA: Event-Triggered Context-Aware Story Generation Augmented by Cross Attention

One of the key challenges of automatic story generation is how to genera...
research
12/20/2022

DOC: Improving Long Story Coherence With Detailed Outline Control

We propose the Detailed Outline Control (DOC) framework for improving lo...
research
09/15/2019

Induction and Reference of Entities in a Visual Story

We are enveloped by stories of visual interpretations in our everyday li...
research
08/24/2017

M2D: Monolog to Dialog Generation for Conversational Story Telling

Storytelling serves many different social functions, e.g. stories are us...
research
12/14/2021

Sentiment Dynamics of Success: Fractal Scaling of Story Arcs Predicts Reader Preferences

We explore the correlation between the sentiment arcs of H. C. Andersen'...
research
11/16/2021

Film Trailer Generation via Task Decomposition

Movie trailers perform multiple functions: they introduce viewers to the...

Please sign up or login with your details

Forgot password? Click here to reset