Bilingual Sentiment Embeddings: Joint Projection of Sentiment Across Languages

05/23/2018
by   Jeremy Barnes, et al.
0

Sentiment analysis in low-resource languages suffers from a lack of annotated corpora to estimate high-performing models. Machine translation and bilingual word embeddings provide some relief through cross-lingual sentiment approaches. However, they either require large amounts of parallel data or do not sufficiently capture sentiment information. We introduce Bilingual Sentiment Embeddings (BLSE), which jointly represent sentiment information in a source and target language. This model only requires a small bilingual lexicon, a source-language corpus annotated for sentiment, and monolingual word embeddings for each language. We perform experiments on three language combinations (Spanish, Catalan, Basque) for sentence-level cross-lingual sentiment classification and find that our model significantly outperforms state-of-the-art methods on four out of six experimental setups, as well as capturing complementary information to machine translation. Our analysis of the resulting embedding space provides evidence that it represents sentiment information in the resource-poor target language without any annotated data in that language.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/24/2019

Embedding Projection for Targeted Cross-Lingual Sentiment: Model Comparisons and a Real-World Study

Sentiment analysis benefits from large, hand-annotated resources in orde...
research
06/06/2016

Adversarial Deep Averaging Networks for Cross-Lingual Sentiment Classification

In recent years deep neural networks have achieved great success in sent...
research
12/15/2016

Building a robust sentiment lexicon with (almost) no resource

Creating sentiment polarity lexicons is labor intensive. Automatically t...
research
10/26/2022

Sinhala Sentence Embedding: A Two-Tiered Structure for Low-Resource Languages

In the process of numerically modeling natural languages, developing lan...
research
10/31/2019

Neural Cross-Lingual Relation Extraction Based on Bilingual Word Embedding Mapping

Relation extraction (RE) seeks to detect and classify semantic relations...
research
10/09/2017

Deep Learning Paradigm with Transformed Monolingual Word Embeddings for Multilingual Sentiment Analysis

The surge of social media use brings huge demand of multilingual sentime...
research
03/24/2021

When Word Embeddings Become Endangered

Big languages such as English and Finnish have many natural language pro...

Please sign up or login with your details

Forgot password? Click here to reset