Mining both Commonality and Specificity from Multiple Documents for Multi-Document Summarization

03/05/2023
by   Bing Ma, et al.
0

The multi-document summarization task requires the designed summarizer to generate a short text that covers the important information of original documents and satisfies content diversity. This paper proposes a multi-document summarization approach based on hierarchical clustering of documents. It utilizes the constructed class tree of documents to extract both the sentences reflecting the commonality of all documents and the sentences reflecting the specificity of some subclasses of these documents for generating a summary, so as to satisfy the coverage and diversity requirements of multi-document summarization. Comparative experiments with different variant approaches on DUC'2002-2004 datasets prove the effectiveness of mining both the commonality and specificity of documents for multi-document summarization. Experiments on DUC'2004 and Multi-News datasets show that our approach achieves competitive performance compared to the state-of-the-art unsupervised and supervised approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/19/2022

Read Top News First: A Document Reordering Approach for Multi-Document News Summarization

A common method for extractive multi-document news summarization is to r...
research
09/08/2023

Unsupervised Multi-document Summarization with Holistic Inference

Multi-document summarization aims to obtain core information from a coll...
research
10/07/2017

Multi-Document Summarization using Distributed Bag-of-Words Model

As the number of documents on the web is growing exponentially, multi-do...
research
11/28/2016

Improving Multi-Document Summarization via Text Classification

Developed so far, multi-document summarization has reached its bottlenec...
research
02/10/2023

PDSum: Prototype-driven Continuous Summarization of Evolving Multi-document Sets Stream

Summarizing text-rich documents has been long studied in the literature,...
research
08/06/2015

Privacy-Preserving Multi-Document Summarization

State-of-the-art extractive multi-document summarization systems are usu...
research
04/11/2023

LBMT team at VLSP2022-Abmusu: Hybrid method with text correlation and generative models for Vietnamese multi-document summarization

Multi-document summarization is challenging because the summaries should...

Please sign up or login with your details

Forgot password? Click here to reset