Neural Sentence Location Prediction for Summarization

04/22/2018
by   Tanner A. Bohn, et al.
0

A competitive baseline in sentence-level extractive summarization of news articles is the Lead-3 heuristic, where only the first 3 sentences are extracted. The success of this method is due to the tendency for writers to implement progressive elaboration in their work by writing the most important content at the beginning. In this paper, we introduce the Lead-like Recognizer (LeadR) to show how the Lead heuristic can be extended to summarize multi-section documents where it would not usually work well. This is done by introducing a neural model which produces a probability distribution over positions for sentences, so that we can locate sentences with introduction-like qualities. To evaluate the performance of our model, we use the task of summarizing multi-section documents. LeadR outperforms several baselines on this task, including a simple extension of the Lead heuristic designed for the task. Our work suggests that predicted position is a strong feature to use when extracting summaries.

READ FULL TEXT
research
01/21/2022

SciBERTSUM: Extractive Summarization for Scientific Documents

The summarization literature focuses on the summarization of news articl...
research
09/08/2019

Countering the Effects of Lead Bias in News Summarization via Multi-Stage Training and Auxiliary Losses

Sentence position is a strong feature for news summarization, since the ...
research
07/17/2020

SummPip: Unsupervised Multi-Document Summarization with Sentence Graph Compression

Obtaining training data for multi-document summarization (MDS) is time c...
research
10/04/2021

Leveraging Information Bottleneck for Scientific Document Summarization

This paper presents an unsupervised extractive approach to summarize sci...
research
05/29/2021

Demoting the Lead Bias in News Summarization via Alternating Adversarial Learning

In news articles the lead bias is a common phenomenon that usually domin...
research
10/28/2022

Toward Unifying Text Segmentation and Long Document Summarization

Text segmentation is important for signaling a document's structure. Wit...
research
07/25/2018

A Novel ILP Framework for Summarizing Content with High Lexical Variety

Summarizing content contributed by individuals can be challenging, becau...

Please sign up or login with your details

Forgot password? Click here to reset