Document summarization using positive pointwise mutual information

05/08/2012
by   Aji S, et al.
0

The degree of success in document summarization processes depends on the performance of the method used in identifying significant sentences in the documents. The collection of unique words characterizes the major signature of the document, and forms the basis for Term-Sentence-Matrix (TSM). The Positive Pointwise Mutual Information, which works well for measuring semantic similarity in the Term-Sentence-Matrix, is used in our method to assign weights for each entry in the Term-Sentence-Matrix. The Sentence-Rank-Matrix generated from this weighted TSM, is then used to extract a summary from the document. Our experiments show that such a method would outperform most of the existing methods in producing summaries from large documents.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/11/2021

Unsupervised Extractive Summarization using Pointwise Mutual Information

Unsupervised approaches to extractive summarization usually rely on a no...
research
02/24/2023

Improving Sentence Similarity Estimation for Unsupervised Extractive Summarization

Unsupervised extractive summarization aims to extract salient sentences ...
research
07/22/2020

Exploratory Search with Sentence Embeddings

Exploratory search aims to guide users through a corpus rather than pinp...
research
07/16/2019

STRASS: A Light and Effective Method for Extractive Summarization Based on Sentence Embeddings

This paper introduces STRASS: Summarization by TRAnsformation Selection ...
research
11/27/2021

An analysis of document graph construction methods for AMR summarization

Meaning Representation (AMR) is a graph-based semantic representation fo...
research
09/22/2021

Investigating Entropy for Extractive Document Summarization

Automatic text summarization aims to cut down readers time and cognitive...
research
09/16/2020

Unsupervised Summarization by Jointly Extracting Sentences and Keywords

We present RepRank, an unsupervised graph-based ranking model for extrac...

Please sign up or login with your details

Forgot password? Click here to reset