Learning Multilingual Word Embeddings in Latent Metric Space: A Geometric Approach

08/27/2018
by   Pratik Jawanpuria, et al.
0

We propose a novel geometric approach for learning bilingual mappings given monolingual embeddings and a bilingual dictionary. Our approach decouples learning the transformation from the source language to the target language into (a) learning rotations for language-specific embeddings to align them to a common space, and (b) learning a similarity metric in the common space to model similarities between the embeddings. We model the bilingual mapping problem as an optimization problem on smooth Riemannian manifolds. We show that our approach outperforms previous approaches on the bilingual lexicon induction and cross-lingual word similarity tasks. We also generalize our framework to represent multiple languages in a common latent space. In particular, the latent space representations for several languages are learned jointly, given bilingual dictionaries for multiple language pairs. We illustrate the effectiveness of joint learning for multiple languages in zero-shot word translation setting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/27/2018

Learning Multilingual Word Embeddings in a Latent Metric Space: A Geometric Approach

We propose a novel geometric approach for learning bilingual mappings gi...
research
04/20/2020

Learning Geometric Word Meta-Embeddings

We propose a geometric framework for learning meta-embeddings of words f...
research
11/02/2018

Unsupervised Hyperalignment for Multilingual Word Embeddings

We consider the problem of aligning continuous word representations, lea...
research
06/05/2020

Filtered Inner Product Projection for Multilingual Embedding Alignment

Due to widespread interest in machine translation and transfer learning,...
research
12/02/2020

On Extending NLP Techniques from the Categorical to the Latent Space: KL Divergence, Zipf's Law, and Similarity Search

Despite the recent successes of deep learning in natural language proces...
research
03/01/2023

Bootstrapping Parallel Anchors for Relative Representations

The use of relative representations for latent embeddings has shown pote...
research
11/02/2022

Learning an Artificial Language for Knowledge-Sharing in Multilingual Translation

The cornerstone of multilingual neural translation is shared representat...

Please sign up or login with your details

Forgot password? Click here to reset