Attributed Rhetorical Structure Grammar for Domain Text Summarization

09/03/2019
by   Ruqian Lu, et al.
0

This paper presents a new approach of automatic text summarization which combines domain oriented text analysis (DoTA) and rhetorical structure theory (RST) in a grammar form: the attributed rhetorical structure grammar (ARSG), where the non-terminal symbols are domain keywords, called domain relations, while the rhetorical relations serve as attributes. We developed machine learning algorithms for learning such a grammar from a corpus of sample domain texts, as well as parsing algorithms for the learned grammar, together with adjustable text summarization algorithms for generating domain specific summaries. Our practical experiments have shown that with support of domain knowledge the drawback of missing very large training data set can be effectively compensated. We have also shown that the knowledge based approach may be made more powerful by introducing grammar parsing and RST as inference engine. For checking the feasibility of model transfer, we introduced a technique for mapping a grammar from one domain to others with acceptable cost. We have also made a comprehensive comparison of our approach with some others.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/21/2019

On Semi-Supervised Multiple Representation Behavior Learning

We propose a novel paradigm of semi-supervised learning (SSL)–the semi-s...
research
10/14/2019

Knowledge-guided Unsupervised Rhetorical Parsing for Text Summarization

Automatic text summarization (ATS) has recently achieved impressive perf...
research
08/07/2019

Embedding-based system for the Text part of CALL v3 shared task

This paper presents a scoring system that has shown the top result on th...
research
07/03/2023

Challenges in Domain-Specific Abstractive Summarization and How to Overcome them

Large Language Models work quite well with general-purpose data and many...
research
11/14/2018

Automatic Grammar Augmentation for Robust Voice Command Recognition

This paper proposes a novel pipeline for automatic grammar augmentation ...
research
06/02/1999

Learning Efficient Disambiguation

This dissertation analyses the computational properties of current perfo...
research
01/05/2016

Joint learning of ontology and semantic parser from text

Semantic parsing methods are used for capturing and representing semanti...

Please sign up or login with your details

Forgot password? Click here to reset