Clickbait? Sensational Headline Generation with Auto-tuned Reinforcement Learning

09/09/2019
by   Peng Xu, et al.
0

Sensational headlines are headlines that capture people's attention and generate reader interest. Conventional abstractive headline generation methods, unlike human writers, do not optimize for maximal reader attention. In this paper, we propose a model that generates sensational headlines without labeled data. We first train a sensationalism scorer by classifying online headlines with many comments ("clickbait") against a baseline of headlines generated from a summarization model. The score from the sensationalism scorer is used as the reward for a reinforcement learner. However, maximizing the noisy sensationalism reward will generate unnatural phrases instead of sensational headlines. To effectively leverage this noisy reward, we propose a novel loss function, Auto-tuned Reinforcement Learning (ARL), to dynamically balance reinforcement learning (RL) with maximum likelihood estimation (MLE). Human evaluation shows that 60.8 which is significantly better than the Pointer-Gen baseline and other RL models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/19/2019

A novel repetition normalized adversarial reward for headline generation

While reinforcement learning can effectively improve language generation...
research
12/19/2022

Inverse Reinforcement Learning for Text Summarization

Current state-of-the-art summarization models are trained with either ma...
research
02/08/2020

RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning

This paper presents a deep reinforcement learning algorithm for online a...
research
09/27/2018

Controllable Neural Story Generation via Reinforcement Learning

Open story generation is the problem of automatically creating a story f...
research
04/18/2021

Keyphrase Generation with Fine-Grained Evaluation-Guided Reinforcement Learning

Aiming to generate a set of keyphrases, Keyphrase Generation (KG) is a c...
research
08/31/2019

Deep Reinforcement Learning with Distributional Semantic Rewards for Abstractive Summarization

Deep reinforcement learning (RL) has been a commonly-used strategy for t...
research
09/12/2018

Automatic, Personalized, and Flexible Playlist Generation using Reinforcement Learning

Songs can be well arranged by professional music curators to form a rive...

Please sign up or login with your details

Forgot password? Click here to reset