Why is unsupervised alignment of English embeddings from different algorithms so hard?

09/01/2018
by   Mareike Hartmann, et al.
0

This paper presents a challenge to the community: Generative adversarial networks (GANs) can perfectly align independent English word embeddings induced using the same algorithm, based on distributional information alone; but fails to do so, for two different embeddings algorithms. Why is that? We believe understanding why, is key to understand both modern word embedding algorithms and the limitations and instability dynamics of GANs. This paper shows that (a) in all these cases, where alignment fails, there exists a linear transform between the two embeddings (so algorithm biases do not lead to non-linear differences), and (b) similar effects can not easily be obtained by varying hyper-parameters. One plausible suggestion based on our initial experiments is that the differences in the inductive biases of the embedding algorithms lead to an optimization landscape that is riddled with local optima, leading to a very small basin of convergence, but we present this more as a challenge paper than a technical contribution.

READ FULL TEXT
research
12/12/2017

Conditional Generative Adversarial Networks for Emoji Synthesis with Word Embedding Manipulation

Emojis have become a very popular part of daily digital communication. T...
research
10/08/2018

Understanding the Origins of Bias in Word Embeddings

The power of machine learning systems not only promises great technical ...
research
12/20/2018

What are the biases in my word embedding?

This paper presents an algorithm for enumerating biases in word embeddin...
research
08/27/2018

Generating Text through Adversarial Training using Skip-Thought Vectors

In the past few years, various advancements have been made in generative...
research
11/15/2022

ALIGN-MLM: Word Embedding Alignment is Crucial for Multilingual Pre-training

Multilingual pre-trained models exhibit zero-shot cross-lingual transfer...
research
11/05/2019

embComp: Visual Interactive Comparison of Vector Embeddings

This work introduces embComp, a novel approach for comparing two embeddi...

Please sign up or login with your details

Forgot password? Click here to reset