What comes next? Extractive summarization by next-sentence prediction

01/12/2019
by   Jingyun Liu, et al.
0

Existing approaches to automatic summarization assume that a length limit for the summary is given, and view content selection as an optimization problem to maximize informativeness and minimize redundancy within this budget. This framework ignores the fact that human-written summaries have rich internal structure which can be exploited to train a summarization system. We present NEXTSUM, a novel approach to summarization based on a model that predicts the next sentence to include in the summary using not only the source article, but also the summary produced so far. We show that such a model successfully captures summary-specific discourse moves, and leads to better content selection performance, in addition to automatically predicting how long the target summary should be. We perform experiments on the New York Times Annotated Corpus of summaries, where NEXTSUM outperforms lead and content-model summarization baselines by significant margins. We also show that the lengths of summaries produced by our system correlates with the lengths of the human-written gold standards.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/21/2022

Generating Multiple-Length Summaries via Reinforcement Learning for Unsupervised Sentence Summarization

Sentence summarization shortens given texts while maintaining core conte...
research
05/25/2022

Factorizing Content and Budget Decisions in Abstractive Summarization of Long Documents by Sampling Summary Views

We argue that disentangling content selection from the budget used to co...
research
05/04/2020

Exploring Content Selection in Summarization of Novel Chapters

We present a new summarization task, generating summaries of novel chapt...
research
11/13/2019

Towards Supervised Extractive Text Summarization via RNN-based Sequence Classification

This article briefly explains our submitted approach to the DocEng'19 co...
research
05/20/2022

On the Trade-off between Redundancy and Local Coherence in Summarization

Extractive summarization systems are known to produce poorly coherent an...
research
07/02/2019

Cooperative Generator-Discriminator Networks for Abstractive Summarization with Narrative Flow

We introduce Cooperative Generator-Discriminator Networks (Co-opNet), a ...
research
10/31/2017

Summarizing Dialogic Arguments from Social Media

Online argumentative dialog is a rich source of information on popular b...

Please sign up or login with your details

Forgot password? Click here to reset