BottleSum: Unsupervised and Self-supervised Sentence Summarization using the Information Bottleneck Principle

09/16/2019
by   Peter West, et al.
0

The principle of the Information Bottleneck (Tishby et al. 1999) is to produce a summary of information X optimized to predict some other relevant information Y. In this paper, we propose a novel approach to unsupervised sentence summarization by mapping the Information Bottleneck principle to a conditional language modelling objective: given a sentence, our approach seeks a compressed sentence that can best predict the next sentence. Our iterative algorithm under the Information Bottleneck objective searches gradually shorter subsequences of the given sentence while maximizing the probability of the next sentence conditioned on the summary. Using only pretrained language models with no direct supervision, our approach can efficiently perform extractive sentence summarization over a large corpus. Building on our unsupervised extractive summarization (BottleSumEx), we then present a new approach to self-supervised abstractive summarization (BottleSumSelf), where a transformer-based language model is trained on the output summaries of our unsupervised method. Empirical results demonstrate that our extractive method outperforms other unsupervised models on multiple automatic metrics. In addition, we find that our self-supervised abstractive model outperforms unsupervised baselines (including our own) by human evaluation along multiple attributes.

READ FULL TEXT
research
05/04/2020

Discrete Optimization for Unsupervised Sentence Summarization with Word-Level Extraction

Automatic sentence summarization produces a shorter version of a sentenc...
research
10/04/2021

Leveraging Information Bottleneck for Scientific Document Summarization

This paper presents an unsupervised extractive approach to summarize sci...
research
07/31/2019

Simple Unsupervised Summarization by Contextual Matching

We propose an unsupervised method for sentence summarization using only ...
research
05/13/2023

Self-Supervised Sentence Compression for Meeting Summarization

The conventional summarization model often fails to capture critical inf...
research
05/11/2021

The Summary Loop: Learning to Write Abstractive Summaries Without Examples

This work presents a new approach to unsupervised abstractive summarizat...
research
04/30/2020

Self-Supervised and Controlled Multi-Document Opinion Summarization

We address the problem of unsupervised abstractive summarization of coll...
research
12/19/2022

Unsupervised Summarization Re-ranking

With the rise of task-specific pre-training objectives, abstractive summ...

Please sign up or login with your details

Forgot password? Click here to reset