Neural Contrastive Clustering: Fully Unsupervised Bias Reduction for Sentiment Classification

04/22/2022
by   Jared Mowery, et al.
0

Background: Neural networks produce biased classification results due to correlation bias (they learn correlations between their inputs and outputs to classify samples, even when those correlations do not represent cause-and-effect relationships). Objective: This study introduces a fully unsupervised method of mitigating correlation bias, demonstrated with sentiment classification on COVID-19 social media data. Methods: Correlation bias in sentiment classification often arises in conversations about controversial topics. Therefore, this study uses adversarial learning to contrast clusters based on sentiment classification labels, with clusters produced by unsupervised topic modeling. This discourages the neural network from learning topic-related features that produce biased classification results. Results: Compared to a baseline classifier, neural contrastive clustering approximately doubles accuracy on bias-prone sentences for human-labeled COVID-19 social media data, without adversely affecting the classifier's overall F1 score. Despite being a fully unsupervised approach, neural contrastive clustering achieves a larger improvement in accuracy on bias-prone sentences than a supervised masking approach. Conclusions: Neural contrastive clustering reduces correlation bias in sentiment text classification. Further research is needed to explore generalizing this technique to other neural network architectures and application domains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/14/2021

Contrastive Clustering: Toward Unsupervised Bias Reduction for Emotion and Sentiment Classification

Background: When neural network emotion and sentiment classifiers are us...
research
12/22/2021

Multimodal Analysis of memes for sentiment extraction

Memes are one of the most ubiquitous forms of social media communication...
research
08/25/2023

Measuring Spurious Correlation in Classification: 'Clever Hans' in Translationese

Recent work has shown evidence of 'Clever Hans' behavior in high-perform...
research
05/08/2023

Cone: Unsupervised Contrastive Opinion Extraction

Contrastive opinion extraction aims to extract a structured summary or k...
research
11/03/2021

End-to-End Annotator Bias Approximation on Crowdsourced Single-Label Sentiment Analysis

Sentiment analysis is often a crowdsourcing task prone to subjective lab...
research
11/10/2017

Joint Sentiment/Topic Modeling on Text Data Using Boosted Restricted Boltzmann Machine

Recently by the development of the Internet and the Web, different types...
research
09/11/2023

Unsupervised Bias Detection in College Student Newspapers

This paper presents a pipeline with minimal human influence for scraping...

Please sign up or login with your details

Forgot password? Click here to reset