Multi-Document Summarization via Discriminative Summary Reranking

by   Xiaojun Wan, et al.

Existing multi-document summarization systems usually rely on a specific summarization model (i.e., a summarization method with a specific parameter setting) to extract summaries for different document sets with different topics. However, according to our quantitative analysis, none of the existing summarization models can always produce high-quality summaries for different document sets, and even a summarization model with good overall performance may produce low-quality summaries for some document sets. On the contrary, a baseline summarization model may produce high-quality summaries for some document sets. Based on the above observations, we treat the summaries produced by different summarization models as candidate summaries, and then explore discriminative reranking techniques to identify high-quality summaries from the candidates for difference document sets. We propose to extract a set of candidate summaries for each document set based on an ILP framework, and then leverage Ranking SVM for summary reranking. Various useful features have been developed for the reranking process, including word-level features, sentence-level features and summary-level features. Evaluation results on the benchmark DUC datasets validate the efficacy and robustness of our proposed approach.



There are no comments yet.


page 1

page 2

page 3

page 4


Centroid-based summarization of multiple documents: sentence extraction, utility-based evaluation, and user studies

We present a multi-document summarizer, called MEAD, which generates sum...

Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents

Faceted summarization provides briefings of a document from different pe...

Controllable Abstractive Summarization

Current models for document summarization ignore user preferences such a...

Improving Faithfulness in Abstractive Summarization with Contrast Candidate Generation and Selection

Despite significant progress in neural abstractive summarization, recent...

Transfer Learning for Abstractive Summarization at Controllable Budgets

Summarizing a document within an allocated budget while maintaining its ...

RetrievalSum: A Retrieval Enhanced Framework for Abstractive Summarization

Existing summarization systems mostly generate summaries purely relying ...

Specificity-Based Sentence Ordering for Multi-Document Extractive Risk Summarization

Risk mining technologies seek to find relevant textual extractions that ...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.