Leveraging Information Bottleneck for Scientific Document Summarization

10/04/2021
by   Jiaxin Ju, et al.
0

This paper presents an unsupervised extractive approach to summarize scientific long documents based on the Information Bottleneck principle. Inspired by previous work which uses the Information Bottleneck principle for sentence compression, we extend it to document level summarization with two separate steps. In the first step, we use signal(s) as queries to retrieve the key content from the source document. Then, a pre-trained language model conducts further sentence search and edit to return the final extracted summaries. Importantly, our work can be flexibly extended to a multi-view framework by different signals. Automatic evaluation on three scientific document datasets verifies the effectiveness of the proposed framework. The further human evaluation suggests that the extracted summaries cover more content aspects than previous systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2019

BottleSum: Unsupervised and Self-supervised Sentence Summarization using the Information Bottleneck Principle

The principle of the Information Bottleneck (Tishby et al. 1999) is to p...
research
07/17/2020

SummPip: Unsupervised Multi-Document Summarization with Sentence Graph Compression

Obtaining training data for multi-document summarization (MDS) is time c...
research
06/02/2022

TSTR: Too Short to Represent, Summarize with Details! Intro-Guided Extended Summary Generation

Many scientific papers such as those in arXiv and PubMed data collection...
research
10/19/2020

SciSummPip: An Unsupervised Scientific Paper Summarization Pipeline

The Scholarly Document Processing (SDP) workshop is to encourage more ef...
research
05/14/2021

EASE: Extractive-Abstractive Summarization with Explanations

Current abstractive summarization systems outperform their extractive co...
research
04/22/2018

Neural Sentence Location Prediction for Summarization

A competitive baseline in sentence-level extractive summarization of new...
research
06/26/2019

User-Oriented Summaries Using a PSO Based Scoring Optimization Method

Automatic text summarization tools have a great impact on many fields, s...

Please sign up or login with your details

Forgot password? Click here to reset