Massive Multi-Document Summarization of Product Reviews with Weak Supervision

07/22/2020
by   Ori Shapira, et al.
0

Product reviews summarization is a type of Multi-Document Summarization (MDS) task in which the summarized document sets are often far larger than in traditional MDS (up to tens of thousands of reviews). We highlight this difference and coin the term "Massive Multi-Document Summarization" (MMDS) to denote an MDS task that involves hundreds of documents or more. Prior work on product reviews summarization considered small samples of the reviews, mainly due to the difficulty of handling massive document sets. We show that summarizing small samples can result in loss of important information and provide misleading evaluation results. We propose a schema for summarizing a massive set of reviews on top of a standard summarization algorithm. Since writing large volumes of reference summaries needed for advanced neural network models is impractical, our solution relies on weak supervision. Finally, we propose an evaluation scheme that is based on multiple crowdsourced reference summaries and aims to capture the massive review collection. We show that an initial implementation of our schema significantly improves over several baselines in ROUGE scores, and exhibits strong coherence in a manual linguistic quality assessment.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/03/2022

PeerSum: A Peer Review Dataset for Abstractive Multi-document Summarization

We present PeerSum, a new MDS dataset using peer reviews of scientific p...
research
04/24/2018

Towards a Neural Network Approach to Abstractive Multi-Document Summarization

Till now, neural abstractive summarization methods have achieved great s...
research
11/03/2020

WSL-DS: Weakly Supervised Learning with Distant Supervision for Query Focused Multi-Document Abstractive Summarization

In the Query Focused Multi-Document Summarization (QF-MDS) task, a set o...
research
09/13/2022

Document-aware Positional Encoding and Linguistic-guided Encoding for Abstractive Multi-document Summarization

One key challenge in multi-document summarization is to capture the rela...
research
10/18/2018

A Temporally Sensitive Submodularity Framework for Timeline Summarization

Timeline summarization (TLS) creates an overview of long-running events ...
research
01/31/2023

Do Multi-Document Summarization Models Synthesize?

Multi-document summarization entails producing concise synopses of colle...
research
10/05/2020

Corpora Evaluation and System Bias Detection in Multi-document Summarization

Multi-document summarization (MDS) is the task of reflecting key points ...

Please sign up or login with your details

Forgot password? Click here to reset