Trajectory-Based Meta-Learning for Out-Of-Vocabulary Word Embedding Learning

02/24/2021
by   Gordon Buck, et al.
0

Word embedding learning methods require a large number of occurrences of a word to accurately learn its embedding. However, out-of-vocabulary (OOV) words which do not appear in the training corpus emerge frequently in the smaller downstream data. Recent work formulated OOV embedding learning as a few-shot regression problem and demonstrated that meta-learning can improve results obtained. However, the algorithm used, model-agnostic meta-learning (MAML) is known to be unstable and perform worse when a large number of gradient steps are used for parameter updates. In this work, we propose the use of Leap, a meta-learning algorithm which leverages the entire trajectory of the learning process instead of just the beginning and the end points, and thus ameliorates these two issues. In our experiments on a benchmark OOV embedding learning dataset and in an extrinsic evaluation, Leap performs comparably or better than MAML. We go on to examine which contexts are most beneficial to learn an OOV embedding from, and propose that the choice of contexts may matter more than the meta-learning employed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2019

Few-Shot Representation Learning for Out-Of-Vocabulary Words

Existing approaches for learning word embeddings often assume there are ...
research
05/22/2020

A Concise Review of Recent Few-shot Meta-learning Methods

Few-shot meta-learning has been recently reviving with expectations to m...
research
05/25/2018

Lifelong Domain Word Embedding via Meta-Learning

Learning high-quality domain word embeddings is important for achieving ...
research
04/04/2021

A contrastive rule for meta-learning

Meta-learning algorithms leverage regularities that are present on a set...
research
06/05/2021

Meta-Learning with Variational Semantic Memory for Word Sense Disambiguation

A critical challenge faced by supervised word sense disambiguation (WSD)...
research
07/09/2020

Principal Word Vectors

We generalize principal component analysis for embedding words into a ve...
research
04/25/2019

Warm Up Cold-start Advertisements: Improving CTR Predictions via Learning to Learn ID Embeddings

Click-through rate (CTR) prediction has been one of the most central pro...

Please sign up or login with your details

Forgot password? Click here to reset