Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents

05/31/2021
by   Rui Meng, et al.
9

Faceted summarization provides briefings of a document from different perspectives. Readers can quickly comprehend the main points of a long document with the help of a structured outline. However, little research has been conducted on this subject, partially due to the lack of large-scale faceted summarization datasets. In this study, we present FacetSum, a faceted summarization benchmark built on Emerald journal articles, covering a diverse range of domains. Different from traditional document-summary pairs, FacetSum provides multiple summaries, each targeted at specific sections of a long document, including the purpose, method, findings, and value. Analyses and empirical results on our dataset reveal the importance of bringing structure into summaries. We believe FacetSum will spur further advances in summarization research and foster the development of NLP systems that can leverage the structured information in both long texts and summaries.

READ FULL TEXT
research
02/08/2023

Long Text and Multi-Table Summarization: Dataset and Method

Automatic document summarization aims to produce a concise summary cover...
research
12/11/2022

MORTY: Structured Summarization for Targeted Information Extraction from Scholarly Articles

Information extraction from scholarly articles is a challenging task due...
research
04/13/2021

MS2: Multi-Document Summarization of Medical Studies

To assess the effectiveness of any medical intervention, researchers mus...
research
05/23/2022

SQuALITY: Building a Long-Document Summarization Dataset the Hard Way

Summarization datasets are often assembled either by scraping naturally ...
research
04/27/2020

Screenplay Summarization Using Latent Narrative Structure

Most general-purpose extractive summarization models are trained on news...
research
08/21/2021

Towards Personalized and Human-in-the-Loop Document Summarization

The ubiquitous availability of computing devices and the widespread use ...
research
06/04/2019

HighRES: Highlight-based Reference-less Evaluation of Summarization

There has been substantial progress in summarization research enabled by...

Please sign up or login with your details

Forgot password? Click here to reset