Text Length Adaptation in Sentiment Classification

09/18/2019
by   Reinald Kim Amplayo, et al.
0

Can a text classifier generalize well for datasets where the text length is different? For example, when short reviews are sentiment-labeled, can these transfer to predict the sentiment of long reviews (i.e., short to long transfer), or vice versa? While unsupervised transfer learning has been well-studied for cross domain/lingual transfer tasks, Cross Length Transfer (CLT) has not yet been explored. One reason is the assumption that length difference is trivially transferable in classification. We show that it is not, because short/long texts differ in context richness and word intensity. We devise new benchmark datasets from diverse domains and languages, and show that existing models from similar tasks cannot deal with the unique challenge of transferring across text lengths. We introduce a strong baseline model called BaggedCNN that treats long texts as bags containing short texts. We propose a state-of-the-art CLT model called Length Transfer Networks (LeTraNets) that introduces a two-way encoding scheme for short and long texts using multiple training mechanisms. We test our models and find that existing models perform worse than the BaggedCNN baseline, while LeTraNets outperforms all models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/18/2023

NollySenti: Leveraging Transfer Learning and Machine Translation for Nigerian Movie Sentiment Classification

Africa has over 2000 indigenous languages but they are under-represented...
research
06/07/2018

Ermes: Emoji-Powered Representation Learning for Cross-Lingual Sentiment Classification

Most existing sentiment analysis approaches heavily rely on a large amou...
research
03/15/2023

Cross-domain Sentiment Classification in Spanish

Sentiment Classification is a fundamental task in the field of Natural L...
research
04/17/2021

Learning to Share by Masking the Non-shared for Multi-domain Sentiment Classification

Multi-domain sentiment classification deals with the scenario where labe...
research
01/04/2020

Adapting Deep Learning for Sentiment Classification of Code-Switched Informal Short Text

Nowadays, an abundance of short text is being generated that uses nonsta...
research
03/26/2021

An Embedding-based Joint Sentiment-Topic Model for Short Texts

Short text is a popular avenue of sharing feedback, opinions and reviews...
research
12/29/2014

Quantifying origin and character of long-range correlations in narrative texts

In natural language using short sentences is considered efficient for co...

Please sign up or login with your details

Forgot password? Click here to reset