Systematically Exploring Redundancy Reduction in Summarizing Long Documents

11/30/2020
by   Wen Xiao, et al.
0

Our analysis of large summarization datasets indicates that redundancy is a very serious problem when summarizing long documents. Yet, redundancy reduction has not been thoroughly investigated in neural summarization. In this work, we systematically explore and compare different ways to deal with redundancy when summarizing long documents. Specifically, we organize the existing methods into categories based on when and how the redundancy is considered. Then, in the context of these categories, we propose three additional methods balancing non-redundancy and importance in a general and flexible way. In a series of experiments, we show that our proposed methods achieve the state-of-the-art with respect to ROUGE scores on two scientific paper datasets, Pubmed and arXiv, while reducing redundancy significantly.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/18/2022

GoSum: Extractive Summarization of Long Documents by Reinforcement Learning and Graph Organized discourse state

Handling long texts with structural information and excluding redundancy...
research
09/17/2019

Extractive Summarization of Long Documents by Combining Global and Local Context

In this paper, we propose a novel neural single document extractive summ...
research
04/13/2020

AREDSUM: Adaptive Redundancy-Aware Iterative Sentence Ranking for Extractive Document Summarization

Redundancy-aware extractive summarization systems score the redundancy o...
research
01/20/2016

Improved Spoken Document Summarization with Coverage Modeling Techniques

Extractive summarization aims at selecting a set of indicative sentences...
research
01/26/2018

A Formal Definition of Importance for Summarization

Research on summarization has mainly been driven by empirical approaches...
research
05/20/2022

On the Trade-off between Redundancy and Local Coherence in Summarization

Extractive summarization systems are known to produce poorly coherent an...
research
10/25/2017

On Component Redundancy Versus System Redundancy for a k-out-of-n System

Precedence order is a natural type of comparison for random variables in...

Please sign up or login with your details

Forgot password? Click here to reset