Unsupervised Word Mapping Using Structural Similarities in Monolingual Embeddings

12/19/2017
by   Hanan Aldarmaki, et al.
0

Most existing methods of automatic bilingual dictionary induction rely on prior alignments between the source and target languages, such as parallel corpora or seed dictionaries. For many language pairs, such supervised alignments are not readily available. We propose an unsupervised approach for learning a bilingual dictionary for a pair of languages given their independently-learned monolingual word embeddings. The proposed method exploits local and global structures in monolingual vector spaces to align them such that similar words are mapped to each other. We show experimentally that the performance of the bilingual alignments learned using the unsupervised method is comparable to supervised bilingual alignments using a seed dictionary.

READ FULL TEXT
research
08/27/2018

Learning Multilingual Word Embeddings in a Latent Metric Space: A Geometric Approach

We propose a novel geometric approach for learning bilingual mappings gi...
research
01/29/2017

Extracting Bilingual Persian Italian Lexicon from Comparable Corpora Using Different Types of Seed Dictionaries

Bilingual dictionaries are very important in various fields of natural l...
research
12/31/2020

Beyond Offline Mapping: Learning Cross Lingual Word Embeddings through Context Anchoring

Recent research on cross-lingual word embeddings has been dominated by u...
research
05/26/2021

Word Embedding Transformation for Robust Unsupervised Bilingual Lexicon Induction

Great progress has been made in unsupervised bilingual lexicon induction...
research
08/04/2016

UsingWord Embeddings for Query Translation for Hindi to English Cross Language Information Retrieval

Cross-Language Information Retrieval (CLIR) has become an important prob...
research
08/31/2020

Discovering Bilingual Lexicons in Polyglot Word Embeddings

Bilingual lexicons and phrase tables are critical resources for modern M...
research
12/21/2016

Inverted Bilingual Topic Models for Lexicon Extraction from Non-parallel Data

Topic models have been successfully applied in lexicon extraction. Howev...

Please sign up or login with your details

Forgot password? Click here to reset