Unsupervised Extractive Summarization using Pointwise Mutual Information

02/11/2021
by   Vishakh Padmakumar, et al.
0

Unsupervised approaches to extractive summarization usually rely on a notion of sentence importance defined by the semantic similarity between a sentence and the document. We propose new metrics of relevance and redundancy using pointwise mutual information (PMI) between sentences, which can be easily computed by a pre-trained language model. Intuitively, a relevant sentence allows readers to infer the document content (high PMI with the document), and a redundant sentence can be inferred from the summary (high PMI with the summary). We then develop a greedy sentence selection algorithm to maximize relevance and minimize redundancy of extracted sentences. We show that our method outperforms similarity-based methods on datasets in a range of domains including news, medical journal articles, and personal anecdotes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/24/2023

Improving Sentence Similarity Estimation for Unsupervised Extractive Summarization

Unsupervised extractive summarization aims to extract salient sentences ...
research
05/08/2012

Document summarization using positive pointwise mutual information

The degree of success in document summarization processes depends on the...
research
11/16/2020

A Two-Phase Approach for Abstractive Podcast Summarization

Podcast summarization is different from summarization of other data form...
research
09/16/2020

Unsupervised Summarization by Jointly Extracting Sentences and Keywords

We present RepRank, an unsupervised graph-based ranking model for extrac...
research
05/31/2019

Improving the Similarity Measure of Determinantal Point Processes for Extractive Multi-Document Summarization

The most important obstacles facing multi-document summarization include...
research
02/02/2019

Query-oriented text summarization based on hypergraph transversals

Existing graph- and hypergraph-based algorithms for document summarizati...
research
10/12/2012

Quick Summary

Quick Summary is an innovate implementation of an automatic document sum...

Please sign up or login with your details

Forgot password? Click here to reset