How "Multi" is Multi-Document Summarization?

10/23/2022
by   Ruben Wolhandler, et al.
0

The task of multi-document summarization (MDS) aims at models that, given multiple documents as input, are able to generate a summary that combines disperse information, originally spread across these documents. Accordingly, it is expected that both reference summaries in MDS datasets, as well as system summaries, would indeed be based on such dispersed information. In this paper, we argue for quantifying and assessing this expectation. To that end, we propose an automated measure for evaluating the degree to which a summary is “disperse”, in the sense of the number of source documents needed to cover its content. We apply our measure to empirically analyze several popular MDS datasets, with respect to their reference summaries, as well as the output of state-of-the-art systems. Our results show that certain MDS datasets barely require combining information from multiple documents, where a single document often covers the full summary content. Overall, we advocate using our metric for assessing and improving the degree to which summarization datasets require combining multi-document information, and similarly how summarization models actually meet this challenge. Our code is available in https://github.com/ariecattan/multi_mds.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/01/2022

Multi-Document Summarization with Centroid-Based Pretraining

In multi-document summarization (MDS), the input is a cluster of documen...
research
03/03/2022

PeerSum: A Peer Review Dataset for Abstractive Multi-document Summarization

We present PeerSum, a new MDS dataset using peer reviews of scientific p...
research
02/09/2023

Generating a Structured Summary of Numerous Academic Papers: Dataset and Method

Writing a survey paper on one research topic usually needs to cover the ...
research
10/05/2020

Corpora Evaluation and System Bias Detection in Multi-document Summarization

Multi-document summarization (MDS) is the task of reflecting key points ...
research
05/19/2021

Analysis of GraphSum's Attention Weights to Improve the Explainability of Multi-Document Summarization

Modern multi-document summarization (MDS) methods are based on transform...
research
03/12/2023

Compressed Heterogeneous Graph for Abstractive Multi-Document Summarization

Multi-document summarization (MDS) aims to generate a summary for a numb...
research
06/07/2023

Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization

Multi-document summarization (MDS) refers to the task of summarizing the...

Please sign up or login with your details

Forgot password? Click here to reset