Improving Topic Segmentation by Injecting Discourse Dependencies

09/18/2022
by   Linzi Xing, et al.
0

Recent neural supervised topic segmentation models achieve distinguished superior effectiveness over unsupervised methods, with the availability of large-scale training corpora sampled from Wikipedia. These models may, however, suffer from limited robustness and transferability caused by exploiting simple linguistic cues for prediction, but overlooking more important inter-sentential topical consistency. To address this issue, we present a discourse-aware neural topic segmentation model with the injection of above-sentence discourse dependency structures to encourage the model make topic boundary prediction based more on the topical consistency between sentences. Our empirical study on English evaluation datasets shows that injecting above-sentence discourse structures to a neural topic segmenter with our proposed strategy can substantially improve its performances on intra-domain and out-of-domain data, with little increase of model's complexity.

READ FULL TEXT

page 1

page 4

research
12/12/2021

Predicting Above-Sentence Discourse Structure using Distant Supervision from Topic Segmentation

RST-style discourse parsing plays a vital role in many NLP tasks, reveal...
research
12/31/2019

A Hybrid Framework for Topic Structure using Laughter Occurrences

Conversational discourse coherence depends on both linguistic and parali...
research
09/11/2018

A Joint Model of Conversational Discourse and Latent Topics on Microblogs

Conventional topic models are ineffective for topic extraction from micr...
research
05/20/2020

Pretraining with Contrastive Sentence Objectives Improves Discourse Performance of Language Models

Recent models for unsupervised representation learning of text have empl...
research
05/15/2023

Uncovering the Potential of ChatGPT for Discourse Analysis in Dialogue: An Empirical Study

Large Language Models (LLMs) like ChatGPT have proven a great shallow un...
research
02/27/2019

DiscoFuse: A Large-Scale Dataset for Discourse-based Sentence Fusion

Sentence fusion is the task of joining several independent sentences int...
research
10/07/2020

Improving Context Modeling in Neural Topic Segmentation

Topic segmentation is critical in key NLP tasks and recent works favor h...

Please sign up or login with your details

Forgot password? Click here to reset