Bad Form: Comparing Context-Based and Form-Based Few-Shot Learning in Distributional Semantic Models

10/01/2019
by   Jeroen Van Hautte, et al.
0

Word embeddings are an essential component in a wide range of natural language processing applications. However, distributional semantic models are known to struggle when only a small number of context sentences are available. Several methods have been proposed to obtain higher-quality vectors for these words, leveraging both this context information and sometimes the word forms themselves through a hybrid approach. We show that the current tasks do not suffice to evaluate models that use word-form information, as such models can easily leverage word forms in the training data that are related to word forms in the test data. We introduce 3 new tasks, allowing for a more balanced comparison between models. Furthermore, we show that hyperparameters that have largely been ignored in previous work can consistently improve the performance of both baseline and advanced models, achieving a new state of the art on 4 out of 6 tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/29/2018

Neural Metaphor Detection in Context

We present end-to-end neural models for detecting metaphorical word use ...
research
07/11/2016

Mapping distributional to model-theoretic semantic spaces: a baseline

Word embeddings have been shown to be useful across state-of-the-art sys...
research
07/20/2017

High-risk learning: acquiring new word vectors from tiny data

Distributional semantics models are known to struggle with small data. I...
research
10/27/2017

One-shot and few-shot learning of word embeddings

Standard deep learning systems require thousands or millions of examples...
research
11/24/2019

Causally Denoise Word Embeddings Using Half-Sibling Regression

Distributional representations of words, also known as word vectors, hav...
research
04/26/2021

Non-Parametric Few-Shot Learning for Word Sense Disambiguation

Word sense disambiguation (WSD) is a long-standing problem in natural la...
research
04/27/2020

Synonyms and Antonyms: Embedded Conflict

Since modern word embeddings are motivated by a distributional hypothesi...

Please sign up or login with your details

Forgot password? Click here to reset