RAPO: An Adaptive Ranking Paradigm for Bilingual Lexicon Induction

10/18/2022
by   Zhoujin Tian, et al.
0

Bilingual lexicon induction induces the word translations by aligning independently trained word embeddings in two languages. Existing approaches generally focus on minimizing the distances between words in the aligned pairs, while suffering from low discriminative capability to distinguish the relative orders between positive and negative candidates. In addition, the mapping function is globally shared by all words, whose performance might be hindered by the deviations in the distributions of different languages. In this work, we propose a novel ranking-oriented induction model RAPO to learn personalized mapping function for each word. RAPO is capable of enjoying the merits from the unique characteristics of a single word and the cross-language isomorphism simultaneously. Extensive experimental results on public datasets including both rich-resource and low-resource languages demonstrate the superiority of our proposal. Our code is publicly available in <https://github.com/Jlfj345wf/RAPO>.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/28/2020

LNMap: Departures from Isomorphic Assumption in Bilingual Lexicon Induction Through Non-Linear Mapping in Latent Space

Most of the successful and predominant methods for bilingual lexicon ind...
research
10/14/2019

Mapping Supervised Bilingual Word Embeddings from English to low-resource languages

It is very challenging to work with low-resource languages due to the in...
research
02/21/2020

Refinement of Unsupervised Cross-Lingual Word Embeddings

Cross-lingual word embeddings aim to bridge the gap between high-resourc...
research
03/23/2015

Unsupervised POS Induction with Word Embeddings

Unsupervised word embeddings have been shown to be valuable as features ...
research
04/19/2023

Low-resource Bilingual Dialect Lexicon Induction with Large Language Models

Bilingual word lexicons are crucial tools for multilingual natural langu...
research
08/08/2022

Automatically constructing Wordnet synsets

Manually constructing a Wordnet is a difficult task, needing years of ex...
research
06/02/2017

One-Sided Unsupervised Domain Mapping

In unsupervised domain mapping, the learner is given two unmatched datas...

Please sign up or login with your details

Forgot password? Click here to reset