GoSum: Extractive Summarization of Long Documents by Reinforcement Learning and Graph Organized discourse state

11/18/2022
by   Junyi Bian, et al.
1

Handling long texts with structural information and excluding redundancy between summary sentences are essential in extractive document summarization. In this work, we propose GoSum, a novel reinforcement-learning-based extractive model for long-paper summarization. GoSum encodes states by building a heterogeneous graph from different discourse levels for each input document. We evaluate the model on two datasets of scientific articles summarization: PubMed and arXiv where it outperforms all extractive summarization models and most of the strong abstractive baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/16/2018

A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents

Neural abstractive summarization models have led to promising results in...
research
02/23/2018

Ranking Sentences for Extractive Summarization with Reinforcement Learning

Single document summarization is the task of producing a shorter version...
research
10/30/2019

Discourse-Aware Neural Extractive Model for Text Summarization

Recently BERT has been adopted in state-of-the-art text summarization mo...
research
12/06/2022

Document-Level Abstractive Summarization

The task of automatic text summarization produces a concise and fluent t...
research
08/19/2017

Measuring the Effect of Discourse Relations on Blog Summarization

The work presented in this paper attempts to evaluate and quantify the u...
research
11/30/2020

Systematically Exploring Redundancy Reduction in Summarizing Long Documents

Our analysis of large summarization datasets indicates that redundancy i...
research
07/19/2021

MemSum: Extractive Summarization of Long Documents using Multi-step Episodic Markov Decision Processes

We introduce MemSum (Multi-step Episodic Markov decision process extract...

Please sign up or login with your details

Forgot password? Click here to reset