Learning Multilingual Word Embeddings in a Latent Metric Space: A Geometric Approach

08/27/2018
by   Pratik Jawanpuria, et al.
0

We propose a novel geometric approach for learning bilingual mappings given monolingual embeddings and a bilingual dictionary. Our approach decouples learning the transformation from the source language to the target language into (a) learning rotations for language-specific embeddings to align them to a common space, and (b) learning a similarity metric in the common space to model similarities between the embeddings. We model the bilingual mapping problem as an optimization problem on smooth Riemannian manifolds. We show that our approach outperforms previous approaches on the bilingual lexicon induction and cross-lingual word similarity tasks. Since we represent the rotated embeddings in a common latent space, our approach can easily represent multiple languages in a common space. We also show that these multilingual embeddings can be learned jointly given bilingual dictionaries for multiple language pairs. We demonstrate the effectiveness of the multilingual embeddings in one zero-shot word translation setting: word translation using these multilingual embeddings is better than word translation using a pivot language when no source-target bilingual dictionary is available, but source-pivot and pivot-target bilingual dictionaries are available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/27/2018

Learning Multilingual Word Embeddings in Latent Metric Space: A Geometric Approach

We propose a novel geometric approach for learning bilingual mappings gi...
research
06/05/2020

Filtered Inner Product Projection for Multilingual Embedding Alignment

Due to widespread interest in machine translation and transfer learning,...
research
04/23/2018

Bilingual Embeddings with Random Walks over Multilingual Wordnets

Bilingual word embeddings represent words of two languages in the same s...
research
12/19/2017

Unsupervised Word Mapping Using Structural Similarities in Monolingual Embeddings

Most existing methods of automatic bilingual dictionary induction rely o...
research
05/14/2019

Multilingual Factor Analysis

In this work we approach the task of learning multilingual word represen...
research
11/02/2018

Unsupervised Hyperalignment for Multilingual Word Embeddings

We consider the problem of aligning continuous word representations, lea...
research
04/20/2020

Learning Geometric Word Meta-Embeddings

We propose a geometric framework for learning meta-embeddings of words f...

Please sign up or login with your details

Forgot password? Click here to reset