Self-Supervised and Controlled Multi-Document Opinion Summarization

04/30/2020
by   Hady Elsahar, et al.
0

We address the problem of unsupervised abstractive summarization of collections of user generated reviews with self-supervision and control. We propose a self-supervised setup that consider an individual document as a target summary for a set of similar documents. This setting makes training simpler than previous approaches by relying only on standard log-likelihood loss. We address the problem of hallucinations through the use of control codes, to steer the generation towards more coherent and relevant summaries.Finally, we extend the Transformer architecture to allow for multiple reviews as input. Our benchmarks on two datasets against graph-based and recent neural abstractive unsupervised models show that our proposed method generates summaries with a superior quality and relevance.This is confirmed in our human evaluation which focuses explicitly on the faithfulness of generated summaries We also provide an ablation study, which shows the importance of the control setup in controlling hallucinations and achieve high sentiment and topic alignment of the summaries with the input reviews.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/12/2018

Unsupervised Neural Multi-document Abstractive Summarization

Abstractive summarization has been studied using neural sequence transdu...
research
11/06/2019

Unsupervised Multi-Document Opinion Summarization as Copycat-Review Generation

Summarization of opinions is the process of automatically creating text ...
research
06/08/2020

Read what you need: Controllable Aspect-based Opinion Summarization of Tourist Reviews

Manually extracting relevant aspects and opinions from large volumes of ...
research
12/14/2020

Unsupervised Opinion Summarization with Content Planning

The recent success of deep learning techniques for abstractive summariza...
research
04/30/2020

Few-Shot Learning for Abstractive Multi-Document Opinion Summarization

Opinion summarization is an automatic creation of text reflecting subjec...
research
12/07/2020

An Enhanced MeanSum Method For Generating Hotel Multi-Review Summarizations

Multi-document summaritazion is the process of taking multiple texts as ...
research
09/16/2019

BottleSum: Unsupervised and Self-supervised Sentence Summarization using the Information Bottleneck Principle

The principle of the Information Bottleneck (Tishby et al. 1999) is to p...

Please sign up or login with your details

Forgot password? Click here to reset