Unity in Diversity: Learning Distributed Heterogeneous Sentence Representation for Extractive Summarization

12/25/2019
by   Abhishek Kumar Singh, et al.
0

Automated multi-document extractive text summarization is a widely studied research problem in the field of natural language understanding. Such extractive mechanisms compute in some form the worthiness of a sentence to be included into the summary. While the conventional approaches rely on human crafted document-independent features to generate a summary, we develop a data-driven novel summary system called HNet, which exploits the various semantic and compositional aspects latent in a sentence to capture document independent features. The network learns sentence representation in a way that, salient sentences are closer in the vector space than non-salient sentences. This semantic and compositional feature vector is then concatenated with the document-dependent features for sentence ranking. Experiments on the DUC benchmark datasets (DUC-2001, DUC-2002 and DUC-2004) indicate that our model shows significant performance gain of around 1.5-2 points in terms of ROUGE score compared with the state-of-the-art baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/25/2019

Hybrid MemNet for Extractive Summarization

Extractive text summarization has been an extensive research problem in ...
research
12/14/2021

Reinforcing Semantic-Symmetry for Document Summarization

Document summarization condenses a long document into a short version wi...
research
04/07/2023

GEMINI: Controlling the Sentence-level Writing Style for Abstractive Text Summarization

Human experts write summaries using different techniques, including rewr...
research
09/20/2019

Learning to Create Sentence Semantic Relation Graphs for Multi-Document Summarization

Linking facts across documents is a challenging task, as the language us...
research
06/12/2017

Extract with Order for Coherent Multi-Document Summarization

In this work, we aim at developing an extractive summarizer in the multi...
research
10/09/2021

Extending Multi-Text Sentence Fusion Resources via Pyramid Annotations

NLP models that compare or consolidate information across multiple docum...
research
11/19/2020

Fact-level Extractive Summarization with Hierarchical Graph Mask on BERT

Most current extractive summarization models generate summaries by selec...

Please sign up or login with your details

Forgot password? Click here to reset