Lex2Sent: A bagging approach to unsupervised sentiment analysis

09/26/2022
by   Kai-Robin Lange, et al.
0

Unsupervised sentiment analysis is traditionally performed by counting those words in a text that are stored in a sentiment lexicon and then assigning a label depending on the proportion of positive and negative words registered. While these "counting" methods are considered to be beneficial as they rate a text deterministically, their classification rates decrease when the analyzed texts are short or the vocabulary differs from what the lexicon considers default. The model proposed in this paper, called Lex2Sent, is an unsupervised sentiment analysis method to improve the classification of sentiment lexicon methods. For this purpose, a Doc2Vec-model is trained to determine the distances between document embeddings and the embeddings of the positive and negative part of a sentiment lexicon. These distances are then evaluated for multiple executions of Doc2Vec on resampled documents and are averaged to perform the classification task. For three benchmark datasets considered in this paper, the proposed Lex2Sent outperforms every evaluated lexicon, including state-of-the-art lexica like VADER or the Opinion Lexicon in terms of classification rate.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/05/2022

Improving Sentiment Analysis By Emotion Lexicon Approach on Vietnamese Texts

The sentiment analysis task has various applications in practice. In the...
research
03/31/2017

Opinion Mining on Non-English Short Text

As the type and the number of such venues increase, automated analysis o...
research
09/28/2017

Sentiment Classification with Word Attention based on Weakly Supervised Learning with a Convolutional Neural Network

In order to maximize the applicability of sentiment analysis results, it...
research
09/29/2021

Classifying Tweet Sentiment Using the Hidden State and Attention Matrix of a Fine-tuned BERTweet Model

This paper introduces a study on tweet sentiment classification. Our tas...
research
11/21/2016

Unsupervised Learning for Lexicon-Based Classification

In lexicon-based classification, documents are assigned labels by compar...
research
04/30/2019

Semi-Unsupervised Lifelong Learning for Sentiment Classification

Lifelong machine learning is a novel machine learning paradigm which con...
research
05/13/2016

Online Optimization Methods for the Quantification Problem

The estimation of class prevalence, i.e., the fraction of a population t...

Please sign up or login with your details

Forgot password? Click here to reset