Knowledge-guided Unsupervised Rhetorical Parsing for Text Summarization

10/14/2019
by   Shengluan Hou, et al.
0

Automatic text summarization (ATS) has recently achieved impressive performance thanks to recent advances in deep learning and the availability of large-scale corpora. To make the summarization results more faithful, this paper presents an unsupervised approach that combines rhetorical structure theory, deep neural model and domain knowledge concern for ATS. This architecture mainly contains three components: domain knowledge base construction based on representation learning, attentional encoder-decoder model for rhetorical parsing and subroutine-based model for text summarization. Domain knowledge can be effectively used for unsupervised rhetorical parsing thus rhetorical structure trees for each document can be derived. In the unsupervised rhetorical parsing module, the idea of translation was adopted to alleviate the problem of data scarcity. The subroutine-based summarization model purely depends on the derived rhetorical structure trees and can generate content-balanced results. To evaluate the summary results without golden standard, we proposed an unsupervised evaluation metric, whose hyper-parameters were tuned by supervised learning. Experimental results show that, on a large-scale Chinese dataset, our proposed approach can obtain comparable performances compared with existing methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/03/2019

Attributed Rhetorical Structure Grammar for Domain Text Summarization

This paper presents a new approach of automatic text summarization which...
research
03/18/2020

Selective Attention Encoders by Syntactic Graph Convolutional Networks for Document Summarization

Abstractive text summarization is a challenging task, and one need to de...
research
10/09/2020

Q-learning with Language Model for Edit-based Unsupervised Summarization

Unsupervised methods are promising for abstractive text summarization in...
research
10/21/2019

On Semi-Supervised Multiple Representation Behavior Learning

We propose a novel paradigm of semi-supervised learning (SSL)–the semi-s...
research
10/27/2020

QBSUM: a Large-Scale Query-Based Document Summarization Dataset from Real-world Applications

Query-based document summarization aims to extract or generate a summary...
research
11/02/2020

How Domain Terminology Affects Meeting Summarization Performance

Meetings are essential to modern organizations. Numerous meetings are he...
research
12/06/2021

An unsupervised extractive summarization method based on multi-round computation

Text summarization methods have attracted much attention all the time. I...

Please sign up or login with your details

Forgot password? Click here to reset