Learning with fuzzy hypergraphs: a topical approach to query-oriented text summarization

06/22/2019
by   Hadrien Van Lierde, et al.
0

Existing graph-based methods for extractive document summarization represent sentences of a corpus as the nodes of a graph or a hypergraph in which edges depict relationships of lexical similarity between sentences. Such approaches fail to capture semantic similarities between sentences when they express a similar information but have few words in common and are thus lexically dissimilar. To overcome this issue, we propose to extract semantic similarities based on topical representations of sentences. Inspired by the Hierarchical Dirichlet Process, we propose a probabilistic topic model in order to infer topic distributions of sentences. As each topic defines a semantic connection among a group of sentences with a certain degree of membership for each sentence, we propose a fuzzy hypergraph model in which nodes are sentences and fuzzy hyperedges are topics. To produce an informative summary, we extract a set of sentences from the corpus by simultaneously maximizing their relevance to a user-defined query, their centrality in the fuzzy hypergraph and their coverage of topics present in the corpus. We formulate a polynomial time algorithm building on the theory of submodular functions to solve the associated optimization problem. A thorough comparative analysis with other graph-based summarization systems is included in the paper. Our obtained results show the superiority of our method in terms of content coverage of the summaries.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/02/2019

Query-oriented text summarization based on hypergraph transversals

Existing graph- and hypergraph-based algorithms for document summarizati...
research
03/07/2019

Small-world networks for summarization of biomedical articles

In recent years, many methods have been developed to identify important ...
research
08/29/2021

Multiplex Graph Neural Network for Extractive Text Summarization

Extractive text summarization aims at extracting the most representative...
research
08/04/2018

Abstractive Summarization Improved by WordNet-based Extractive Sentences

Recently, the seq2seq abstractive summarization models have achieved goo...
research
02/16/2021

Finite Atomized Semilattices

We show that every finite semilattice can be represented as an atomized ...
research
12/19/2022

Graph-based Semantical Extractive Text Analysis

In the past few decades, there has been an explosion in the amount of av...
research
05/11/2018

Using Statistical and Semantic Models for Multi-Document Summarization

We report a series of experiments with different semantic models on top ...

Please sign up or login with your details

Forgot password? Click here to reset