Improving the Similarity Measure of Determinantal Point Processes for Extractive Multi-Document Summarization

05/31/2019
by   Sangwoo Cho, et al.
0

The most important obstacles facing multi-document summarization include excessive redundancy in source descriptions and the looming shortage of training data. These obstacles prevent encoder-decoder models from being used directly, but optimization-based methods such as determinantal point processes (DPPs) are known to handle them well. In this paper we seek to strengthen a DPP-based method for extractive multi-document summarization by presenting a novel similarity measure inspired by capsule networks. The approach measures redundancy between a pair of sentences based on surface form and semantic information. We show that our DPP system with improved similarity measure performs competitively, outperforming strong summarization baselines on benchmark datasets. Our findings are particularly meaningful for summarizing documents created by multiple authors containing redundant yet lexically diverse expressions.

READ FULL TEXT
research
08/19/2018

Adapting the Neural Encoder-Decoder Framework from Single to Multi-Document Summarization

Generating an abstract from a set of relevant documents remains challeng...
research
10/15/2021

Modeling Endorsement for Multi-Document Abstractive Summarization

A crucial difference between single- and multi-document summarization is...
research
02/11/2021

Unsupervised Extractive Summarization using Pointwise Mutual Information

Unsupervised approaches to extractive summarization usually rely on a no...
research
09/08/2023

Unsupervised Multi-document Summarization with Holistic Inference

Multi-document summarization aims to obtain core information from a coll...
research
02/14/2012

Learning Determinantal Point Processes

Determinantal point processes (DPPs), which arise in random matrix theor...
research
09/30/2020

Multi-document Summarization with Maximal Marginal Relevance-guided Reinforcement Learning

While neural sequence learning methods have made significant progress in...
research
05/20/2022

On the Trade-off between Redundancy and Local Coherence in Summarization

Extractive summarization systems are known to produce poorly coherent an...

Please sign up or login with your details

Forgot password? Click here to reset