Learning to Create Sentence Semantic Relation Graphs for Multi-Document Summarization

09/20/2019
by   Diego Antognini, et al.
0

Linking facts across documents is a challenging task, as the language used to express the same information in a sentence can vary significantly, which complicates the task of multi-document summarization. Consequently, existing approaches heavily rely on hand-crafted features, which are domain-dependent and hard to craft, or additional annotated data, which is costly to gather. To overcome these limitations, we present a novel method, which makes use of two types of sentence embeddings: universal embeddings, which are trained on a large unrelated corpus, and domain-specific embeddings, which are learned during training. To this end, we develop SemSentSum, a fully data-driven model able to leverage both types of sentence embeddings by building a sentence semantic relation graph. SemSentSum achieves competitive results on two types of summary, consisting of 665 bytes and 100 words. Unlike other state-of-the-art models, neither hand-crafted features nor additional annotated data are necessary, and the method is easily adaptable for other tasks. To our knowledge, we are the first to use multiple sentence embeddings for the task of multi-document summarization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/20/2017

Graph-based Neural Multi-Document Summarization

We propose a neural multi-document summarization (MDS) system that incor...
research
12/25/2019

Unity in Diversity: Learning Distributed Heterogeneous Sentence Representation for Extractive Summarization

Automated multi-document extractive text summarization is a widely studi...
research
03/29/2017

Automatic Argumentative-Zoning Using Word2vec

In comparison with document summarization on the articles from social me...
research
05/27/2020

Catching Attention with Automatic Pull Quote Selection

Pull quotes are an effective component of a captivating news article. Th...
research
10/04/2019

Neural Language Priors

The choice of sentence encoder architecture reflects assumptions about h...
research
02/22/2019

Improving Multilingual Sentence Embedding using Bi-directional Dual Encoder with Additive Margin Softmax

In this paper, we present an approach to learn multilingual sentence emb...
research
05/14/2018

Unsupervised Abstractive Meeting Summarization with Multi-Sentence Compression and Budgeted Submodular Maximization

We introduce a novel graph-based framework for abstractive meeting speec...

Please sign up or login with your details

Forgot password? Click here to reset