Unsupervised Opinion Summarization with Noising and Denoising

04/21/2020
by   Reinald Kim Amplayo, et al.
0

The supervised training of high-capacity models on large datasets containing hundreds of thousands of document-summary pairs is critical to the recent success of deep learning techniques for abstractive summarization. Unfortunately, in most domains (other than news) such training data is not available and cannot be easily sourced. In this paper we enable the use of supervised learning for the setting where there are only documents available (e.g., product or business reviews) without ground truth summaries. We create a synthetic dataset from a corpus of user reviews by sampling a review, pretending it is a summary, and generating noisy versions thereof which we treat as pseudo-review input. We introduce several linguistically motivated noise generation functions and a summarization model which learns to denoise the input and generate the original review. At test time, the model accepts genuine reviews and generates a summary containing salient opinions, treating those that do not reach consensus as noise. Extensive automatic and human evaluation shows that our model brings substantial improvements over both abstractive and extractive baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/14/2020

Unsupervised Opinion Summarization with Content Planning

The recent success of deep learning techniques for abstractive summariza...
research
06/13/2019

Unsupervised Neural Single-Document Summarization of Reviews via Learning Latent Discourse Structure and its Ranking

This paper focuses on the end-to-end abstractive summarization of a sing...
research
11/06/2019

Unsupervised Multi-Document Opinion Summarization as Copycat-Review Generation

Summarization of opinions is the process of automatically creating text ...
research
05/24/2023

Meta-review Generation with Checklist-guided Iterative Introspection

Opinions in the scientific domain can be divergent, leading to controver...
research
04/30/2020

Few-Shot Learning for Abstractive Multi-Document Opinion Summarization

Opinion summarization is an automatic creation of text reflecting subjec...
research
04/30/2012

OCT Segmentation Survey and Summary Reviews and a Novel 3D Segmentation Algorithm and a Proof of Concept Implementation

We overview the existing OCT work, especially the practical aspects of i...
research
12/21/2022

OpineSum: Entailment-based self-training for abstractive opinion summarization

A typical product or place often has hundreds of reviews, and summarizat...

Please sign up or login with your details

Forgot password? Click here to reset