SciSummPip: An Unsupervised Scientific Paper Summarization Pipeline

10/19/2020
by   Jiaxin Ju, et al.
0

The Scholarly Document Processing (SDP) workshop is to encourage more efforts on natural language understanding of scientific task. It contains three shared tasks and we participate in the LongSumm shared task. In this paper, we describe our text summarization system, SciSummPip, inspired by SummPip (Zhao et al., 2020) that is an unsupervised text summarization system for multi-document in news domain. Our SciSummPip includes a transformer-based language model SciBERT (Beltagy et al., 2019) for contextual sentence representation, content selection with PageRank (Page et al., 1999), sentence graph construction with both deep and linguistic information, sentence graph clustering and within-graph summary generation. Our work differs from previous method in that content selection and a summary length constraint is applied to adapt to the scientific domain. The experiment results on both training dataset and blind test dataset show the effectiveness of our method, and we empirically verify the robustness of modules used in SciSummPip with BERTScore (Zhang et al., 2019a).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2017

Text Summarization using Abstract Meaning Representation

With an ever increasing size of text present on the Internet, automatic ...
research
03/15/2016

Unsupervised Ranking Model for Entity Coreference Resolution

Coreference resolution is one of the first stages in deep language under...
research
06/02/2021

Enriching Transformers with Structured Tensor-Product Representations for Abstractive Summarization

Abstractive summarization, the task of generating a concise summary of i...
research
03/29/2019

Keyphrase Generation: A Text Summarization Struggle

Authors' keyphrases assigned to scientific articles are essential for re...
research
07/25/2017

The RepEval 2017 Shared Task: Multi-Genre Natural Language Inference with Sentence Representations

This paper presents the results of the RepEval 2017 Shared Task, which e...
research
10/04/2021

Leveraging Information Bottleneck for Scientific Document Summarization

This paper presents an unsupervised extractive approach to summarize sci...
research
06/02/2018

Does the brain represent words? An evaluation of brain decoding studies of language understanding

Language decoding studies have identified word representations which can...

Please sign up or login with your details

Forgot password? Click here to reset