A Comparison of Semantic Similarity Methods for Maximum Human Interpretability

10/21/2019
by   Pinky Sitikhu, et al.
0

The inclusion of semantic information in any similarity measures improves the efficiency of the similarity measure and provides human interpretable result. This paper presents three different methods to compute semantic similarities between short news texts. These methods are based on corpus-based and knowledge-based methods: cosine similarity using tf-idf vectors, cosine similarity using word embedding and soft cosine similarity using word embedding. As a result, cosine similarity using tf-idf vectors performed best among three in finding similarities between short news texts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2017

Synergistic Union of Word2Vec and Lexicon for Domain Specific Semantic Similarity

Semantic similarity measures are an important part in Natural Language P...
research
09/02/2016

Improving Correlation with Human Judgments by Integrating Semantic Similarity with Second--Order Vectors

Vector space methods that measure semantic similarity and relatedness of...
research
07/25/2022

COSIME: FeFET based Associative Memory for In-Memory Cosine Similarity Search

In a number of machine learning models, an input query is searched acros...
research
07/29/2019

One-to-X analogical reasoning on word embeddings: a case for diachronic armed conflict prediction from news texts

We extend the well-known word analogy task to a one-to-X formulation, in...
research
01/23/2017

dna2vec: Consistent vector representations of variable-length k-mers

One of the ubiquitous representation of long DNA sequence is dividing it...
research
06/11/2012

Dimension Independent Similarity Computation

We present a suite of algorithms for Dimension Independent Similarity Co...
research
11/10/2016

Tracing metaphors in time through self-distance in vector spaces

From a diachronic corpus of Italian, we build consecutive vector spaces ...

Please sign up or login with your details

Forgot password? Click here to reset