Issues in evaluating semantic spaces using word analogies

06/24/2016
by   Tal Linzen, et al.
0

The offset method for solving word analogies has become a standard evaluation tool for vector-space semantic models: it is considered desirable for a space to represent semantic relations as consistent vector offsets. We show that the method's reliance on cosine similarity conflates offset consistency with largely irrelevant neighborhood structure, and propose simple baselines that should be used to improve the utility of the method in vector space evaluation.

READ FULL TEXT

page 3

page 4

research
04/02/2019

Neural Vector Conceptualization for Word Vector Space Interpretation

Distributed word vector spaces are considered hard to interpret which hi...
research
03/20/2019

Distributed Vector Representations of Folksong Motifs

This article presents a distributed vector representation model for lear...
research
09/01/2020

Document Similarity from Vector Space Densities

We propose a computationally light method for estimating similarities be...
research
08/01/2015

Separated by an Un-common Language: Towards Judgment Language Informed Vector Space Modeling

A common evaluation practice in the vector space models (VSMs) literatur...
research
08/16/2021

IsoScore: Measuring the Uniformity of Vector Space Utilization

The recent success of distributed word representations has led to an inc...
research
03/05/2017

Random vector generation of a semantic space

We show how random vectors and random projection can be implemented in t...
research
10/24/2020

Word2vec Conjecture and A Limitative Result

Being inspired by the success of word2vec <cit.> in capturing analogies,...

Please sign up or login with your details

Forgot password? Click here to reset