Controllable Neural Story Generation via Reinforcement Learning

09/27/2018
by   Pradyumna Tambwekar, et al.
0

Open story generation is the problem of automatically creating a story for any domain without retraining. Neural language models can be trained on large corpora across many domains and then used to generate stories. However, stories generated via language models tend to lack direction and coherence. We introduce a policy gradient reinforcement learning approach to open story generation that learns to achieve a given narrative goal state. In this work, the goal is for a story to end with a specific type of event, given in advance. However, a reward based on achieving the given goal is too sparse for effective learning. We use reward shaping to provide the reinforcement learner with a partial reward at every step. We show that our technique can train a model that generates a story that reaches the goal 94 perplexity. A human subject evaluation shows that stories generated by our technique are perceived to have significantly higher plausible event ordering and plot coherence over a baseline language modeling technique without perceived degradation of overall quality, enjoyability, or local causality.

READ FULL TEXT
research
12/16/2021

Goal-Directed Story Generation: Augmenting Generative Language Models with Reinforcement Learning

The advent of large pre-trained generative language models has provided ...
research
12/16/2022

Neural Story Planning

Automated plot generation is the challenge of generating a sequence of e...
research
01/24/2023

Can Very Large Pretrained Language Models Learn Storytelling With A Few Examples?

While pre-trained language models can generate individually fluent sente...
research
06/05/2017

Event Representations for Automated Story Generation with Deep Neural Nets

Automated story generation is the problem of automatically selecting a s...
research
09/09/2019

Clickbait? Sensational Headline Generation with Auto-tuned Reinforcement Learning

Sensational headlines are headlines that capture people's attention and ...
research
05/28/2023

Reward Collapse in Aligning Large Language Models

The extraordinary capabilities of large language models (LLMs) such as C...
research
09/12/2018

Automatic, Personalized, and Flexible Playlist Generation using Reinforcement Learning

Songs can be well arranged by professional music curators to form a rive...

Please sign up or login with your details

Forgot password? Click here to reset