Improved Document Modelling with a Neural Discourse Parser

11/16/2019
by   Fajri Koto, et al.
0

Despite the success of attention-based neural models for natural language generation and classification tasks, they are unable to capture the discourse structure of larger documents. We hypothesize that explicit discourse representations have utility for NLP tasks over longer documents or document sequences, which sequence-to-sequence models are unable to capture. For abstractive summarization, for instance, conventional neural models simply match source documents and the summary in a latent space without explicit representation of text structure or relations. In this paper, we propose to use neural discourse representations obtained from a rhetorical structure theory (RST) parser to enhance document representations. Specifically, document representations are generated for discourse spans, known as the elementary discourse units (EDUs). We empirically investigate the benefit of the proposed approach on two different tasks: abstractive summarization and popularity prediction of online petitions. We find that the proposed approach leads to improvements in all cases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/16/2018

A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents

Neural abstractive summarization models have led to promising results in...
research
05/30/2019

Hierarchical Transformers for Multi-Document Summarization

In this paper, we develop a neural summarization model which can effecti...
research
06/04/2019

Evaluating Discourse in Structured Text Representations

Discourse structure is integral to understanding a text and is helpful i...
research
11/06/2020

Unleashing the Power of Neural Discourse Parsers – A Context and Structure Aware Approach Using Large Scale Pretraining

RST-based discourse parsing is an important NLP task with numerous downs...
research
05/25/2017

Learning Structured Text Representations

In this paper, we focus on learning structure-aware document representat...
research
02/04/2022

Tracking Discourse Influence in Darknet Forums

This technical report documents our efforts in addressing the tasks set ...
research
06/13/2019

Unsupervised Neural Single-Document Summarization of Reviews via Learning Latent Discourse Structure and its Ranking

This paper focuses on the end-to-end abstractive summarization of a sing...

Please sign up or login with your details

Forgot password? Click here to reset