Generating a Structured Summary of Numerous Academic Papers: Dataset and Method

02/09/2023
by   Shuaiqi Liu, et al.
0

Writing a survey paper on one research topic usually needs to cover the salient content from numerous related papers, which can be modeled as a multi-document summarization (MDS) task. Existing MDS datasets usually focus on producing the structureless summary covering a few input documents. Meanwhile, previous structured summary generation works focus on summarizing a single document into a multi-section summary. These existing datasets and methods cannot meet the requirements of summarizing numerous academic papers into a structured summary. To deal with the scarcity of available data, we propose BigSurvey, the first large-scale dataset for generating comprehensive summaries of numerous academic papers on each topic. We collect target summaries from more than seven thousand survey papers and utilize their 430 thousand reference papers' abstracts as input documents. To organize the diverse content from dozens of input documents and ensure the efficiency of processing long text sequences, we propose a summarization method named category-based alignment and sparse transformer (CAST). The experimental results show that our CAST method outperforms various advanced summarization methods.

READ FULL TEXT
research
10/23/2022

How "Multi" is Multi-Document Summarization?

The task of multi-document summarization (MDS) aims at models that, give...
research
02/08/2023

Long Text and Multi-Table Summarization: Dataset and Method

Automatic document summarization aims to produce a concise summary cover...
research
03/31/2023

ConceptEVA: Concept-Based Interactive Exploration and Customization of Document Summaries

With the most advanced natural language processing and artificial intell...
research
04/18/2021

Generating Related Work

Communicating new research ideas involves highlighting similarities and ...
research
04/13/2020

A Divide-and-Conquer Approach to the Summarization of Academic Articles

We present a novel divide-and-conquer method for the summarization of lo...
research
06/02/2022

TSTR: Too Short to Represent, Summarize with Details! Intro-Guided Extended Summary Generation

Many scientific papers such as those in arXiv and PubMed data collection...
research
09/19/2019

How to Write Summaries with Patterns? Learning towards Abstractive Summarization through Prototype Editing

Under special circumstances, summaries should conform to a particular st...

Please sign up or login with your details

Forgot password? Click here to reset