Unsupervised Multi-Document Opinion Summarization as Copycat-Review Generation

11/06/2019
by   Arthur Bražinskas, et al.
14

Summarization of opinions is the process of automatically creating text summaries that reflect subjective information expressed in input documents, such as product reviews. While most previous research in opinion summarization has focused on the extractive setting, i.e. selecting fragments of the input documents to produce a summary, we let the model generate novel sentences and hence produce fluent text. Supervised abstractive summarization methods typically rely on large quantities of document-summary pairs which are expensive to acquire. In contrast, we consider the unsupervised setting, in other words, we do not use any summaries in training. We define a generative model for a multi-product review collection. Intuitively, we want to design such a model that, when generating a new review given a set of other reviews of the product, we can control the `amount of novelty' going into the new review or, equivalently, vary the degree of deviation from the input reviews. At test time, when generating summaries, we force the novelty to be minimal, and produce a text reflecting consensus opinions. We capture this intuition by defining a hierarchical variational autoencoder model. Both individual reviews and products they correspond to are associated with stochastic latent codes, and the review generator ('decoder') has direct access to the text of input reviews through the pointer-generator mechanism. In experiments on Amazon and Yelp data, we show that in this model by setting at test time the review's latent code to its mean, we produce fluent and coherent summaries.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/30/2020

Few-Shot Learning for Abstractive Multi-Document Opinion Summarization

Opinion summarization is an automatic creation of text reflecting subjec...
research
05/24/2023

Meta-review Generation with Checklist-guided Iterative Introspection

Opinions in the scientific domain can be divergent, leading to controver...
research
11/27/2022

Unsupervised Opinion Summarisation in the Wasserstein Space

Opinion summarisation synthesises opinions expressed in a group of docum...
research
07/26/2023

Automatically Evaluating Opinion Prevalence in Opinion Summarization

When faced with a large number of product reviews, it is not clear that ...
research
04/30/2020

Self-Supervised and Controlled Multi-Document Opinion Summarization

We address the problem of unsupervised abstractive summarization of coll...
research
04/21/2020

Unsupervised Opinion Summarization with Noising and Denoising

The supervised training of high-capacity models on large datasets contai...
research
12/14/2020

Unsupervised Opinion Summarization with Content Planning

The recent success of deep learning techniques for abstractive summariza...

Please sign up or login with your details

Forgot password? Click here to reset