Log In Sign Up

An Embedding-based Joint Sentiment-Topic Model for Short Texts

by   Ayan Sengupta, et al.

Short text is a popular avenue of sharing feedback, opinions and reviews on social media, e-commerce platforms, etc. Many companies need to extract meaningful information (which may include thematic content as well as semantic polarity) out of such short texts to understand users' behaviour. However, obtaining high quality sentiment-associated and human interpretable themes still remains a challenge for short texts. In this paper we develop ELJST, an embedding enhanced generative joint sentiment-topic model that can discover more coherent and diverse topics from short texts. It uses Markov Random Field Regularizer that can be seen as a generalisation of skip-gram based models. Further, it can leverage higher-order semantic information appearing in word embedding, such as self-attention weights in graphical models. Our results show an average improvement of 10 diversification over baselines. Finally, ELJST helps understand users' behaviour at more granular levels which can be explained. All these can bring significant values to the service and healthcare industries often dealing with customers.


page 1

page 2

page 3

page 4


Weakly-Supervised Aspect-Based Sentiment Analysis via Joint Aspect-Sentiment Topic Embedding

Aspect-based sentiment analysis of review texts is of great value for un...

Context Enhanced Short Text Matching using Clickthrough Data

The short text matching task employs a model to determine whether two sh...

Joint Sentiment/Topic Modeling on Text Data Using Boosted Restricted Boltzmann Machine

Recently by the development of the Internet and the Web, different types...

Short Text Topic Modeling Techniques, Applications, and Performance: A Survey

Analyzing short texts infers discriminative and coherent latent topics t...

Text Length Adaptation in Sentiment Classification

Can a text classifier generalize well for datasets where the text length...

Word Network Topic Model: A Simple but General Solution for Short and Imbalanced Texts

The short text has been the prevalent format for information of Internet...

Leveraging an Efficient and Semantic Location Embedding to Seek New Ports of Bike Share Services

For short distance traveling in crowded urban areas, bike share services...