Cross-lingual Transfer of Twitter Sentiment Models Using a Common Vector Space

05/15/2020
by   Marko Robnik-Šikonja, et al.
0

Word embeddings represent words in a numeric space in such a way that semantic relations between words are encoded as distances and directions in the vector space. Cross-lingual word embeddings map words from one language to the vector space of another language, or words from multiple languages to the same vector space where similar words are aligned. Cross-lingual embeddings can be used to transfer machine learning models between languages and thereby compensate for insufficient data in less-resourced languages. We use cross-lingual word embeddings to transfer machine learning prediction models for Twitter sentiment between 13 languages. We focus on two transfer mechanisms using the joint numerical space for many languages as implemented in the LASER library: the transfer of trained models, and expansion of training sets with instances from other languages. Our experiments show that the transfer of models between similar languages is sensible, while dataset expansion did not increase the predictive performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/09/2018

Predicting Concreteness and Imageability of Words Within and Across Languages via Word Embeddings

The notions of concreteness and imageability, traditionally important in...
research
07/21/2017

Cross-Lingual Induction and Transfer of Verb Classes Based on Word Vector Space Specialisation

Existing approaches to automatic VerbNet-style verb classification are h...
research
10/24/2019

Wasserstein distances for evaluating cross-lingual embeddings

Word embeddings are high dimensional vector representations of words tha...
research
06/30/2020

Traceability Support for Multi-Lingual Software Projects

Software traceability establishes associations between diverse software ...
research
12/10/2019

Machine Translation with Cross-lingual Word Embeddings

Learning word embeddings using distributional information is a task that...
research
03/27/2019

Image search using multilingual texts: a cross-modal learning approach between image and text

Multilingual (or cross-lingual) embeddings represent several languages i...
research
03/27/2019

Image search using multilingual texts: a cross-modal learning approach between image and text Maxime Portaz Qwant Research

Multilingual (or cross-lingual) embeddings represent several languages i...

Please sign up or login with your details

Forgot password? Click here to reset