Embedding Projection for Targeted Cross-Lingual Sentiment: Model Comparisons and a Real-World Study

06/24/2019
by   Jeremy Barnes, et al.
0

Sentiment analysis benefits from large, hand-annotated resources in order to train and test machine learning models, which are often data hungry. While some languages, e.g., English, have a vast array of these resources, most under-resourced languages do not, especially for fine-grained sentiment tasks, such as aspect-level or targeted sentiment analysis. To improve this situation, we propose a cross-lingual approach to sentiment analysis that is applicable to under-resourced languages and takes into account target-level information. This model incorporates sentiment information into bilingual distributional representations, by jointly optimizing them for semantics and sentiment, showing state-of-the-art performance at sentence-level when combined with machine translation. The adaptation to targeted sentiment analysis on multiple domains shows that our model outperforms other projection-based bilingual embedding methods on binary targeted sentiment tasks. Our analysis on ten languages demonstrates that the amount of unlabeled monolingual data has surprisingly little effect on the sentiment results. As expected, the choice of annotated source language for projection to a target leads to better results for source-target language pairs which are similar. Therefore, our results suggest that more efforts should be spent on the creation of resources for less similar languages to those which are resource-rich already. Finally, a domain mismatch leads to a decreased performance. This suggests resources in any language should ideally cover varieties of domains.

READ FULL TEXT
research
05/23/2018

Bilingual Sentiment Embeddings: Joint Projection of Sentiment Across Languages

Sentiment analysis in low-resource languages suffers from a lack of anno...
research
07/06/2017

Cross-Lingual Sentiment Analysis Without (Good) Translation

Current approaches to cross-lingual sentiment analysis try to leverage t...
research
03/22/2018

MultiBooked: A Corpus of Basque and Catalan Hotel Reviews Annotated for Aspect-level Sentiment Classification

While sentiment analysis has become an established field in the NLP comm...
research
04/02/2022

CL-XABSA: Contrastive Learning for Cross-lingual Aspect-based Sentiment Analysis

As an extensive research in the field of Natural language processing (NL...
research
08/19/2019

Fine-grained Sentiment Analysis with Faithful Attention

While the general task of textual sentiment classification has been wide...
research
06/06/2016

Adversarial Deep Averaging Networks for Cross-Lingual Sentiment Classification

In recent years deep neural networks have achieved great success in sent...
research
02/06/2017

Q-WordNet PPV: Simple, Robust and (almost) Unsupervised Generation of Polarity Lexicons for Multiple Languages

This paper presents a simple, robust and (almost) unsupervised dictionar...

Please sign up or login with your details

Forgot password? Click here to reset