Predicting Above-Sentence Discourse Structure using Distant Supervision from Topic Segmentation

12/12/2021
by   Patrick Huber, et al.
0

RST-style discourse parsing plays a vital role in many NLP tasks, revealing the underlying semantic/pragmatic structure of potentially complex and diverse documents. Despite its importance, one of the most prevailing limitations in modern day discourse parsing is the lack of large-scale datasets. To overcome the data sparsity issue, distantly supervised approaches from tasks like sentiment analysis and summarization have been recently proposed. Here, we extend this line of research by exploiting distant supervision from topic segmentation, which can arguably provide a strong and oftentimes complementary signal for high-level discourse structures. Experiments on two human-annotated discourse treebanks confirm that our proposal generates accurate tree structures on sentence and paragraph level, consistently outperforming previous distantly supervised models on the sentence-to-document task and occasionally reaching even higher scores on the sentence-to-paragraph level.

READ FULL TEXT
research
11/05/2020

MEGA RST Discourse Treebanks with Structure and Nuclearity from Scalable Distant Sentiment Supervision

The lack of large and diverse discourse treebanks hinders the applicatio...
research
05/23/2023

Topic-driven Distant Supervision Framework for Macro-level Discourse Parsing

Discourse parsing, the task of analyzing the internal rhetorical structu...
research
09/08/2023

RST-style Discourse Parsing Guided by Document-level Content Structures

Rhetorical Structure Theory based Discourse Parsing (RST-DP) explores ho...
research
05/15/2023

Uncovering the Potential of ChatGPT for Discourse Analysis in Dialogue: An Empirical Study

Large Language Models (LLMs) like ChatGPT have proven a great shallow un...
research
09/18/2022

Improving Topic Segmentation by Injecting Discourse Dependencies

Recent neural supervised topic segmentation models achieve distinguished...
research
01/02/2021

Multitask Learning for Class-Imbalanced Discourse Classification

Small class-imbalanced datasets, common in many high-level semantic task...
research
10/30/2019

Predicting Discourse Structure using Distant Supervision from Sentiment

Discourse parsing could not yet take full advantage of the neural NLP re...

Please sign up or login with your details

Forgot password? Click here to reset