DeepAI AI Chat
Log In Sign Up

Multitask Learning for Class-Imbalanced Discourse Classification

by   Alexander Spangher, et al.

Small class-imbalanced datasets, common in many high-level semantic tasks like discourse analysis, present a particular challenge to current deep-learning architectures. In this work, we perform an extensive analysis on sentence-level classification approaches for the News Discourse dataset, one of the largest high-level semantic discourse datasets recently published. We show that a multitask approach can improve 7 state-of-the-art benchmarks, due in part to label corrections across tasks, which improve performance for underrepresented classes. We also offer a comparative review of additional techniques proposed to address resource-poor problems in NLP, and show that none of these approaches can improve classification accuracy in such a setting.


page 5

page 16


Predicting Above-Sentence Discourse Structure using Distant Supervision from Topic Segmentation

RST-style discourse parsing plays a vital role in many NLP tasks, reveal...

DiscSense: Automated Semantic Analysis of Discourse Markers

Discourse markers ( by contrast, happily, etc.) are words or phrases th...

Uncovering the Potential of ChatGPT for Discourse Analysis in Dialogue: An Empirical Study

Large Language Models (LLMs) like ChatGPT have proven a great shallow un...

Imbalanced Sentiment Classification Enhanced with Discourse Marker

Imbalanced data commonly exists in real world, espacially in sentiment-r...

Discourse Relation Embeddings: Representing the Relations between Discourse Segments in Social Media

Discourse relations are typically modeled as a discrete class that chara...

Learning Outside the Box: Discourse-level Features Improve Metaphor Identification

Most current approaches to metaphor identification use restricted lingui...

Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced Data

The automatic identification of propaganda has gained significance in re...