Imbalanced Sentiment Classification Enhanced with Discourse Marker

03/28/2019
by   Tao Zhang, et al.
0

Imbalanced data commonly exists in real world, espacially in sentiment-related corpus, making it difficult to train a classifier to distinguish latent sentiment in text data. We observe that humans often express transitional emotion between two adjacent discourses with discourse markers like "but", "though", "while", etc, and the head discourse and the tail discourse 3 usually indicate opposite emotional tendencies. Based on this observation, we propose a novel plug-and-play method, which first samples discourses according to transitional discourse markers and then validates sentimental polarities with the help of a pretrained attention-based model. Our method increases sample diversity in the first place, can serve as a upstream preprocessing part in data augmentation. We conduct experiments on three public sentiment datasets, with several frequently used algorithms. Results show that our method is found to be consistently effective, even in highly imbalanced scenario, and easily be integrated with oversampling method to boost the performance on imbalanced sentiment classification.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/05/2020

From Sentiment Annotations to Sentiment Prediction through Discourse Augmentation

Sentiment analysis, especially for long documents, plausibly requires me...
research
09/04/2015

Better Document-level Sentiment Analysis from RST Discourse Parsing

Discourse structure is the hidden link between surface features and docu...
research
11/05/2020

MEGA RST Discourse Treebanks with Structure and Nuclearity from Scalable Distant Sentiment Supervision

The lack of large and diverse discourse treebanks hinders the applicatio...
research
04/18/2017

Sentiment analysis based on rhetorical structure theory: Learning deep neural networks from discourse trees

Prominent applications of sentiment analysis are countless, covering are...
research
01/02/2021

Multitask Learning for Class-Imbalanced Discourse Classification

Small class-imbalanced datasets, common in many high-level semantic task...
research
05/18/2022

Features of Perceived Metaphoricity on the Discourse Level: Abstractness and Emotionality

Research on metaphorical language has shown ties between abstractness an...

Please sign up or login with your details

Forgot password? Click here to reset