A Deep Reinforced Model for Abstractive Summarization

05/11/2017
by   Romain Paulus, et al.
0

Attentional, RNN-based encoder-decoder models for abstractive summarization have achieved good performance on short input and output sequences. For longer documents and summaries however these models often include repetitive and incoherent phrases. We introduce a neural network model with a novel intra-attention that attends over the input and continuously generated output separately, and a new training method that combines standard supervised word prediction and reinforcement learning (RL). Models trained only with supervised learning often exhibit "exposure bias" - they assume ground truth is provided at each step during training. However, when standard word prediction is combined with the global sequence prediction training of RL the resulting summaries become more readable. We evaluate this model on the CNN/Daily Mail and New York Times datasets. Our model obtains a 41.16 ROUGE-1 score on the CNN/Daily Mail dataset, an improvement over previous state-of-the-art models. Human evaluation also shows that our model produces higher quality summaries.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/05/2021

Efficient Attentions for Long Document Summarization

The quadratic computational and memory complexities of large Transformer...
research
09/04/2019

An Entity-Driven Framework for Abstractive Summarization

Abstractive summarization systems aim to produce more coherent and conci...
research
04/07/2021

Spotify at TREC 2020: Genre-Aware Abstractive Podcast Summarization

This paper contains the description of our submissions to the summarizat...
research
12/11/2019

Quality of syntactic implication of RL-based sentence summarization

Work on summarization has explored both reinforcement learning (RL) opti...
research
08/13/2021

MeetSum: Transforming Meeting Transcript Summarization using Transformers!

Creating abstractive summaries from meeting transcripts has proven to be...
research
08/26/2021

Alleviating Exposure Bias via Contrastive Learning for Abstractive Text Summarization

Encoder-decoder models have achieved remarkable success in abstractive t...
research
07/03/2020

Abstractive and mixed summarization for long-single documents

The lack of diversity in the datasets available for automatic summarizat...

Please sign up or login with your details

Forgot password? Click here to reset