Unsupervised Measure of Word Similarity: How to Outperform Co-occurrence and Vector Cosine in VSMs

03/30/2016
by   Enrico Santus, et al.
0

In this paper, we claim that vector cosine, which is generally considered among the most efficient unsupervised measures for identifying word similarity in Vector Space Models, can be outperformed by an unsupervised measure that calculates the extent of the intersection among the most mutually dependent contexts of the target words. To prove it, we describe and evaluate APSyn, a variant of the Average Precision that, without any optimization, outperforms the vector cosine and the co-occurrence on the standard ESL test set, with an improvement ranging between +9.00 chosen top contexts.

READ FULL TEXT

page 1

page 2

research
03/29/2016

What a Nerd! Beating Students and Vector Cosine in the ESL and TOEFL Datasets

In this paper, we claim that Vector Cosine, which is generally considere...
research
08/27/2016

Testing APSyn against Vector Cosine on Similarity Estimation

In Distributional Semantic Models (DSMs), Vector Cosine is widely used t...
research
12/25/2014

Plagiarism Detection on Electronic Text based Assignments using Vector Space Model (ICIAfS14)

Plagiarism is known as illegal use of others' part of work or whole work...
research
12/31/2017

A New Approach for Measuring Sentiment Orientation based on Multi-Dimensional Vector Space

This study implements a vector space model approach to measure the senti...
research
06/10/2023

Using orthogonally structured positive bases for constructing positive k-spanning sets with cosine measure guarantees

Positive spanning sets span a given vector space by nonnegative linear c...
research
09/03/2018

Hypernyms Through Intra-Article Organization in Wikipedia

We introduce a new measure for unsupervised hypernym detection and direc...
research
11/10/2016

Tracing metaphors in time through self-distance in vector spaces

From a diachronic corpus of Italian, we build consecutive vector spaces ...

Please sign up or login with your details

Forgot password? Click here to reset