EDU-level Extractive Summarization with Varying Summary Lengths

10/08/2022
by   Yuping Wu, et al.
0

Extractive models usually formulate text summarization as extracting top-k important sentences from document as summary. Few work exploited extracting finer-grained Elementary Discourse Unit (EDU) and there is little analysis and justification for the extractive unit selection. To fill such a gap, this paper firstly conducts oracle analysis to compare the upper bound of performance for models based on EDUs and sentences. The analysis provides evidences from both theoretical and experimental perspectives to justify that EDUs make more concise and precise summary than sentences without losing salient information. Then, considering this merit of EDUs, this paper further proposes EDU-level extractive model with Varying summary Lengths (EDU-VL) and develops the corresponding learning algorithm. EDU-VL learns to encode and predict probabilities of EDUs in document, and encode EDU-level candidate summaries with different lengths based on various k values and select the best candidate summary in an end-to-end training manner. Finally, the proposed and developed approach is experimented on single and multi-document benchmark datasets and shows the improved performances in comparison with the state-of-the-art models.

READ FULL TEXT
research
08/25/2017

Revisiting the Centroid-based Method: A Strong Baseline for Multi-Document Summarization

The centroid-based model for extractive document summarization is a simp...
research
05/02/2023

DiffuSum: Generation Enhanced Extractive Summarization with Diffusion

Extractive summarization aims to form a summary by directly extracting s...
research
03/26/2019

Document Similarity for Texts of Varying Lengths via Hidden Topics

Measuring similarity between texts is an important task for several appl...
research
12/14/2021

Reinforcing Semantic-Symmetry for Document Summarization

Document summarization condenses a long document into a short version wi...
research
06/13/2019

Unsupervised Neural Single-Document Summarization of Reviews via Learning Latent Discourse Structure and its Ranking

This paper focuses on the end-to-end abstractive summarization of a sing...
research
04/19/2020

Extractive Summarization as Text Matching

This paper creates a paradigm shift with regard to the way we build neur...
research
03/15/2022

Differentiable Multi-Agent Actor-Critic for Multi-Step Radiology Report Summarization

The IMPRESSIONS section of a radiology report about an imaging study is ...

Please sign up or login with your details

Forgot password? Click here to reset