SciBERTSUM: Extractive Summarization for Scientific Documents

01/21/2022
by   Athar Sefid, et al.
0

The summarization literature focuses on the summarization of news articles. The news articles in the CNN-DailyMail are relatively short documents with about 30 sentences per document on average. We introduce SciBERTSUM, our summarization framework designed for the summarization of long documents like scientific papers with more than 500 sentences. SciBERTSUM extends BERTSUM to long documents by 1) adding a section embedding layer to include section information in the sentence vector and 2) applying a sparse attention mechanism where each sentences will attend locally to nearby sentences and only a small number of sentences attend globally to all other sentences. We used slides generated by the authors of scientific papers as reference summaries since they contain the technical details from the paper. The results show the superiority of our model in terms of ROUGE scores.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/22/2018

Neural Sentence Location Prediction for Summarization

A competitive baseline in sentence-level extractive summarization of new...
research
04/24/2018

Data-driven Summarization of Scientific Articles

Data-driven approaches to sequence-to-sequence modelling have been succe...
research
07/03/2020

Abstractive and mixed summarization for long-single documents

The lack of diversity in the datasets available for automatic summarizat...
research
11/16/2020

A Two-Phase Approach for Abstractive Podcast Summarization

Podcast summarization is different from summarization of other data form...
research
08/19/2022

Sparse Optimization for Unsupervised Extractive Summarization of Long Documents with the Frank-Wolfe Algorithm

We address the problem of unsupervised extractive document summarization...
research
10/21/2020

ReSCo-CC: Unsupervised Identification of Key Disinformation Sentences

Disinformation is often presented in long textual articles, especially w...
research
08/25/2020

Extractive Summarizer for Scholarly Articles

We introduce an extractive method that will summarize long scientific pa...

Please sign up or login with your details

Forgot password? Click here to reset